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Interference for Data Transmission 
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(Manuscript received April 22, 1981) 


In this paper, we analyze a technique for accurately detecting 
transmitted data symbols contained in a modulated signal that has 
been degraded by a linear dispersive channel and additive Gaussian 
noise. The approach uses an adaptive equalizer which provides 
preliminary decisions to an adaptive canceller. The canceller output 
is used to remove the interference from an adaptive matching filter, 
resulting in the desired signal. Channel equalization attempts to 
invert the channel transfer function, while avoiding excessive noise 
enhancement. However, cancellation (as used in echo cancellers), 
attempts to generate a replica of the interfering signal and subtract 
it from the actual received signal containing the sum of the desired 
signal and interference. The cancellation approach, unlike equali- 
zation, offers the possibility of removing interference without enhanc- 
ing the level of noise already present in the received waveform. 
Simulation results for transmission over practical channels show 
significant improvement of linear cancellation over both linear for- 
ward and decision-feedback equalization. 


|. INTRODUCTION 


For the past twenty years, engineers have been seeking new tech- 
niques to combat the intersymbol interference (IsI) in data transmis- 
sion over band-limited channels. Adaptive equalization with the mean- 
square algorithm has been the major technique that allowed a sub- 
stantial increase in attainable transmission rate.’ If the channel has 
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only phase distortion, then the linear fractionally spaced equalizer can 
eliminate virtually all of the 1st without enhancing the noise level.”* 
However, when amplitude distortion is present in the channel, any 
adaptive linear equalizer (LE) must compromise between inverting the 
channel transfer function and avoiding excessive noise enhancement. 
Inevitably, some noise enhancement occurs. Decision-feedback equal- 
ization can offer somewhat improved performance when amplitude 
distortion is present.*” By using the Viterbi algorithm,® maximum- 
likelihood receivers, in principle, offer the best performance possible 
but depend on adaptive estimators of the channel and require an 
impractically high complexity when the channel impulse response is 
long, as in the case of the typical telephone channel. 

If there were no IsI, the probability of error in detecting the trans- 
mitted data level (i.e. +1 or +3) would be the same as if only one such 
pulse were transmitted in isolation. In that case, the optimal receiver 
(for Gaussian noise) would be a matched filter and would yield a 
certain error probability, Po. When pulses are sent sequentially, the 
effect of IsI cannot be totally eliminated. The maximum likelihood 
estimator of the entire sequence of transmitted symbols is known to 
result in an error probability that is somewhat larger than Pp. 

In this paper, we describe a cancellation technique designed to 
achieve isolated-pulse, matched-filter performance. Extensive simula- 
tion results confirm that there is a significant improvement over linear 
or decision-feedback equalization for severe amplitude-distorting chan- 
nels of practical interest. 


I. BACKGROUND AND MOTIVATION 


The idea of cancellation was used for the echo problem in two-wire 
telephony (see Ref. 7), where the received signal contains an interfer- 
ence component that is a filtered and delayed version of an “originat- 
ing” signal. In that application, the originating signal is actually 
available at the same location. However, in data transmission the 
originating signal—the transmitted data stream—is not directly ob- 
servable at the receiver. The idea of using preliminary decisions to 
generate an intermediate estimate of the transmitted data signal was 
independently proposed by various investigators. All of the proposals 
included adapting the coefficients of a filter that forms a replica of the 
1st. Adaptation is, of course, needed since the appropriate filtering 
operation is not known in advance and can vary with time. Hirsch and 
Proakis proposed the cancellation scheme with the essential structure 
in Figure 1.°° Here the canceller attempts to remove the Is! directly 
from the received line signal. This approach does not achieve improved 
performance over linear equalization if there is phase distortion. In 
this paper, we describe a linear canceller (LC) structure where a 
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Fig. 1—First generation linear canceller. 


transversal filter W is used instead of the delay in Fig. 1. Both C and 
W are adapted simultaneously with the error signal between the input 
to the final detector and the appropriate reference. 

The motivation for this structure stems from the need to effectively 
detect high-speed data on channels that have both a high noise level 
and substantial amplitude (slope) and phase distortion. In certain 
conditions of practical interest, even if the equalizer were of infinite 
length, the noise enhancement of linear or decision feedback equali- 
zation makes it impossible to achieve the required error rate. Never-— 
theless, linear equalization is sufficient to obtain fairly moderate error 
rates so that the detected symbols may be adequate as preliminary 
decisions for a cancellation scheme. 

In Section 3, we provide intuitive reasoning that the optimal choice 
for W is a matched filter, and the optimal C is a canceller whose tap 
weights are the samples of the channel autocorrelation function, except 
for the center tap, which has zero weight. We show in Section 4 that 
this is true under the assumption that the preliminary decisions A, are 
correct. Section 5 covers adaptive operation and Section 6, simulation 
results. 


- Il. FORMULATION 


Let the transmitted data symbols be denoted Ai, Ag, Az, --- with 
each complex valued symbol having real and imaginary parts restricted 
to one of a finite set of values (i.e. +1, +3). A complex-valued pulse 
shape p(t) is used to generate the baseband transmitted data signal 
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s(t) = » A; p(t — kT). (1) 


The linear distortion of the channel results in the received waveform 


X(t) = }) Anh(t — kT) + Vit), (2) 
k 


where V(t) is white noise and A(t) is the overall channel impulse 
response. 

Suppose initially that the receiver consists of a matched filter and a 
sampler (rate 1/T) followed by a symbol detector as shown in Fig. 2. 
The matched filter has impulse response 


W(t) =h*(LT — t), (3) 


where * denotes complex conjugation and the integer L is chosen large 
enough so that the output is small for t < 0. The output of the sampler 
is then 


U(mT) = > Aar(mT — kT — LT) + V'(mT), (4) 
k 


where V'(é) is the colored noise at the matched filter output and r(t) 
is the autocorrelation function of the pulse A(t); that is, r(t) = 
“0 A(s)g(s + t)ds. Equation (4) can be written as 


U(mT) = Am-ur(0) + Vi(mT) + In_t, (5) 
where 
In-t= YY Asr(mT — kT - LT). (6) 
k#m-L 


Suppose that at time t = mT the receiver must detect the currently 
observable symbol A,,-, and it knows all prior symbols Am-_z-1, 
An-L-2,... and all subsequent symbols Am_—z+41, Am—L+2,..., Am that 
determine the value [,,-, of the total 1st. In this case, Jnm-z is a known 
constant and can be subtracted from U(mT). What remains is exactly 
the output value that the matched filter would produce if the trans- 
mitter sent a single isolated pulse An_1 p[t — (m — L)T]. Hence, the 
ideal performance, with error probability Po, would be achieved. 


TRANSMITTER 


CHANNEL 





Vit) 


Fig. 2—Model of transmitter, channel and matched filter (MF). 
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Fig. 3—Decision feedback equalizer. 


The above reasoning suggests that we could approach the ideal— 
isolated symbol—performance with each symbol decision if we could 
generate a good estimate of the total IsI, J,,-., at each sampling instant, 
t = mT. The decision feedback approach can be viewed as a partial 
step in this direction. This approach is based on the idea that we can 
estimate the prior symbols called precursors by storing and processing 
the outputs Am—z-1, Am-.-2,... already produced by the detector. The 
part of [,,-. determined by the precursors can then be constructed. By 
applying the decision A; to a feedback filter, the output is subtracted 
from U(mT) and the resultant signal is applied to the detector. The 
decision-feedback equalizer (DFE) is shown in Fig. 3. Since we have 
not removed all of the IsI, the resulting performance will be inferior to 
that of the isolated-pulse case. This discussion shows that the DFE 
technique can be regarded as a partial step towards the goal of totally 
removing ISI. We next examine how we go beyond the stage of 
postcursor cancellation to include precursor cancellation. 

Suppose at time mT we could also eliminate the subsequent, post- 
cursor, symbols A,,~1+1, Am-1+2,..., Am that also contribute to the total 
interference at time mT. Then, using eq. (6), an estimate of the total 
ISI, [,-1, would be available at time instant mT. This is not possible 
using the output of the detector in Fig. 2. However, suppose a separate 
equalizer operates on the received data signal y(t) as shown in (a) of 
Fig. 4. If optimally designed, it will have a modest error rate, and we 
can use its decision A,, as preliminary or tentative decisions for the 
purpose of constructing our estimate of [,,.. Now there is no problem 
in obtaining both precursor and postcursor estimates needed to form 
In-L. By introducing a fixed time delay, D, to the received signal prior 
to the matched filter as shown in (a), the LE has a head-start in 
estimating data symbols. A practical implementation of an adaptive 
passband canceller would take the form shown in (b) of Fig. 4. 

The delay D can actually be incorporated into the matched filter by 
choosing L suitably large. The cancellation filter C produces the 
estimate Imn_z of the actual interference 

Int = Y Aw(mT—kT- LT), (7) 


k¥m—-L 
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DELAY, D 


Fig. 4—(a) Linear canceller. (b) Passband linear canceller structure. 


(b) 
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where A, is the sequence of preliminary decisions. Note that the 
transversal filter C, which takes as input A,, and produces the output 
f,,-1 at time mT, has an impulse response 
_ jrti-(m—-L)T] t~Am-—L 
a= {ij on or (8) 
We shall see in Section IV that, under the assumption of perfect 
preliminary decisions, this is indeed the optimum impulse response in 
the mean-square sense. 


IV. OPTIMAL CANCELLATION 
4.1 Derivation of optimal filter coefficients 


To determine the optimal pair of filters W and C for the cancellation 
scheme, we make the simplifying assumption that the preliminary 
decisions available from the LE are correct. We focus on the structure 
shown in Fig. 5, where the filter W, called the matching filter, is a 
T/2-spaced infinite length transversal filter preceded by a sampler 
operating at rate 2/T samples per second. The filter C, called the 
canceller, is a T-shaped infinite length transversal filter. 

The matching filter W has input samples, y; = y(/T/2), where y(t) is 
the received line signal, and the output, U,, of W is taken at time 
instants, t = kT, as indicated in Fig. 5 by the 1/T rate sampler. The 
input to canceller C is the true data sequence {A,} since we have 
assumed the tentative decisions are correct. Thus, Fig. 5 shows C being 
fed directly by the transmitted data symbols. It is not necessary to 
explicitly consider the time delay D since we are allowing infinite 
length filters. The output of the canceller V,, is subtracted from the 
matching-filter output. This difference producing U,, — V,, to be applied 
to a slicer may be viewed as a linear estimator of the data symbol Ap. 
The goal is to determine the filters W and C that minimize 


E, = E(\en|), (9) 


the mean-square error (mse), where €, = U;, — Vn — An. 






exp(j{Wot) —_ a(t) 


Fig. 5—Model of a linear canceller. 
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Let a(t) denote the additive receiver noise as shown in Fig. 5. Then, 
the input to the matching filter is given by 


yr = exp(JwolT/2) ¥\ A,AUIT/2 — vT) + a(lT/2) 


= exp(jwolT/2) b A,A(IT/2 — vT) + ot) | (10) 


where /(¢) is the complex impulse response of the channel and includes 
the effect of the transmitter shaping filter p(t) as treated in Section 
III. The term ai(-) is the complex baseband noise sample. All sum- 
mations are over the integers from — to ©, unless otherwise indicated. 
The output U,, of the matching filter is 


Un, = exp(—jwonT) ¥ Wi yon—as (11) 
P 


where W; denotes the tap weights of W. Let h; = A(/T/2) and define 
b= Wohi. _ (12) 


Then U,, may be written in the form 


Un = 5 Asbon—2s + Nn, (13) 


where 


Nr = » W3 den-s (14) 


is the noise at the output of the matching filter. 

We shall let C; denote the ith tap weight of the cancellation filter for 
each integer i, except 1 = 0. We make the constraint that the center 
tap weight of the cancellation filter is zero. This restricts the role of 
the canceller to removing IsI and prevents the canceller from making 
use of the current data symbol which must be estimated by the output 
signal from the matching filter. The canceller output V,, is given by 


Y=) C7 An-i, (15) 
i40 
and we assume that 
E|A,|? =], (16) 


which is a convenient normalization of the data symbol power level. 

To minimize the mse in eq. (9), we differentiate E,, with respect to 
the complex tap weights {C;,} and {W,} and set the derivatives to zero. 
Using eq. (15), it can be shown to yield 


2004 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


| E(ex An-m) =0 m#0, (17) 

and using eq. (11), 

exp(—jwonT) E(€2 Yon—m) = 0 all m. (18) 
Thus, these optimality conditions require that the error signal e be 
orthogonal to the observable inputs to the C and W filters, namely, 
{A,} and { yn}. 

We discuss two cases of interest. In Case 1, the channel is wideband. 
We assume that the W filter processes the line signal directly from the 
channel without prior band-limiting so that even when sampled at a 
rate of 2/7, the noise samples are uncorrelated. In Case 2, which is 
more relevant to our situation of a channel band-limited to the voice 
frequency range, the noise samples at T'/2 spacing are correlated. 

Case 1. Uncorrelated noise samples 

In Case 1, 


E[a(lT/2)a*(kT/2)] = oS, 
where o” is the noise variance. We also define a new term, 


R,(L) = ¥ A*(JT/2)A(JT/2 + 1T/2), (19) 
J 


which is the autocorrelation function of the T/2-sampled impulse 
response, h(j7'/2). Then, with 
T 
ht j-— 
(72) 


= R;,(0), (20) 


2 


E, = 


J 








we show in the Appendix that the matching filter has T/2-spaced tap 
weights 
h(—mT/2) 


Wr = exp(—jwomT/2) aig 


; all m, (21) 


which is clearly proportional to a matched-filter impulse response. The 
C taps are shown in the Appendix to be 
1 


Thus, the canceller impulse response, for m # 0, is that of the overall 
T-spaced impulse response of the channel and matching filter. 


Case 2. Correlated noise samples 


As described earlier, Case 2 corresponds to the voiceband telephone 
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channel, where the noise has approximately the same bandwidth as 
the signal so that noise samples at 7/2 spacing are correlated. 
The noise correlation is 


E[(akT/2)a*(lT/2)] = Rrlk — l). 


We define W(w) as the Fourier transform of the W tap weights, S,,(w) 
is the sampled noise spectrum, and H(w) is the Fourier transform of 


tha nhannal_ceamnlad imnulen vaannnen Than unth 
VLILS VLLALALIUL OGL LOL a2asputovu svopviroe. a aavuil, vW¥AUAL 


T (= |A(—w —wo) |? 


dn au S,,(—w) dw, (23) 


g= 
we show in the Appendix that 
H (—w —w») 
W(w) =——__———.. 24 
P= 0 8.—u) ee 
The corresponding Fourier transform of the canceller is 
1 |A(-w)|? 
ee eee RNC, Keay 2 
oe) 1+ €S,(— w + wo) 2) 
Note that since 


Ri (—2m) = Ri (2m), 


and S,(—w + wo) is real, we see from eqs. (22) and (25) that the 
optimum, infinitely long canceller sampled impulse response is Her- 
mitian symmetric about the center, i.e. 


Cn = Com. (26) 


4.2 Derivation of mse 


We now derive the mse achieved for Case 1, under the assumption 
that W and C have the optimal impulse responses given by eqs. (21) 
and (22), respectively. The matching filter output is given by 


1 
n= Bye ou AeRa2n — 2s) + Nn. (27) 


The noise, N,, is the result of applying white noise with variance o” to 
the matching filter eq. (65). Hence, using eqs. (66) and (67), we find 
that 


oE, 
— 
E|N,| “Th. + oF ae (28) 


Also, the canceller output is given by 





a x A;R,(2n — 2s). (29) 
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Hence, the error signal, with R,(0) = Ep, is 





En 
6, = U,, Ve — An = pan + Na An 

o 

=o (ate) An + Nn, (30) 
so that, using eq. (30), the mse is 
o 
= ee 

E, = Elen| Ea (31) 


The s/n, p, at the output of the cancellation system is defined as the 
ratio 


_£E|U,-—V,-N,|’ 


E(N,|° 32) 
Hence, we find that 
pares En 
= or (33) 


4.3 On the property of invariance of mse to timing phase 


The mse expression in eq. (31) can be written in terms of the 
channel-sampled power spectrum. If H(w) is the Fourier transform of 
the overall impulse response A(t), then the transform of h(jT/2) is 








a 2 Ark Qa 
ls) =F5 Hl oe SE ol <3 (34) 
and 
eels 
a H(i) 
eee T | 1 (w) |?d (35) 
ae es w) | “dw. 


Therefore, from eq. (31) the mse of the optimum Lc, normalized to 
unit signal power, is 


T (2 
= —, |7 "dw +1 
Exc 1/ | sae |» | A (cw) |?dw (36) 
If the matching filter were T-spaced, then we have 
Po 1 2rk 
H(w) ==> Al wo+— 37 
(w) Te (. T ) (37) 
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and 
mee |e | (w) | ) |?dw. (38) 
We can compare the results with similar expressions for the optimum 


LE and DFE in Ref. 5 which assume matched filters preceding the 
equalizers. They are 


= 2 ae eo 
and 
Eprr = exp {- = i T In[Y(w) + ide}, (40) 
Tv 
where 
1 Onk\ |” 








The expression in eq. (39) is the same as that for an infinitely long 
fractionally spaced LE,® whereas the result for a symbol-spaced equal- 


izer is 
_T = 4 oak 
pan z (2 [1/a[3 (e+ =) 


Finally, we note that if the overall channel-impulse response has less 
than 100 percent rolloff and the matching filter is T’'/2-spaced, then 


+ 1 fa (42) 








Ale) = ZH), 


and 
T (% 
En = — ie | H(w) |?dw. 
Aor = 


Clearly, Exc is independent of any phase characteristics and, hence, 
the canceller performance is insensitive to timing phase. This property 
is shared by the fractionally spaced LE”* and the symbol-spaced LE 
and DFE which are preceded by matched filters, as exhibited by eqs. 
(39) and (40). On the other hand, by itself, the symbol-spaced LE is 
sensitive to timing phase, as seen in eq. (42), because the integrand is 
a function of the folded spectrum of the channel. This is because a 
symbol-spaced filter cannot properly synthesize a proper matched 
filter. Likewise, the LC with a T-spaced matching filter is sensitive to 
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timing phase, although some simulations have indicated that the effect 
of a bad timing phase seems to be less on the Lc than on the LE. 


V. ADAPTIVE CANCELLER STRUCTURE 


In practice, we have finite length filters and the ensemble averages 
described above are not available to the receiver. As in the case of the 
adaptive LE, we adjust the complex {C;,} taps of the canceller and 
{ W;} taps of the matching filter to minimize the instantaneous squared 
error, 


|e.|? = |Un — Vn — An|?, (43) 


at each time instant nT. As shown in (b) of Fig. 4, U, and V,, are the 
outputs of the matching filter and canceller, respectively, where 


E=1 
Un = exp[—j(wonT + 6;,) ] > Wr 2n-hky (44) 
-L 
and 
M A 
Viz So CAs, (45) 
k=—-M 
#0 


where we have 2L complex W taps at 7/2 spacing and 2M complex C 
taps at T' spacing. Unlike the ideal case in Fig. 5, A,-, are the tentative 
decisions obtained from the LE, and 90, is the estimate of any phase 
jitter present. The value of A, in eq. (48) is that of the ideal reference 
in the training mode and is the receiver’s output decision in the 
decision-directed mode of operation. Note that e, in eq. (48) is a linear 
function of the tap weights {C;, W;} so that it is possible in theory to 
jointly adapt the tap weights to achieve a unique mse. 

It can be shown that the er of |e,|* with respect to the tap 
weights are given by 


Sl en|* _ 
~LEnlin—-ky 46 
aC, 26% An» (46) 
and 
STO a5 [—j(wonT + 6) ]e# (47) 
aw, p J won n n Y2n-k- 


Thus, the adjustment algorithms are 


k=-M,.--,M 


hk <0, (48) 


Ci(n + 1) = Cp(n) + BetAn-e 
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and 
W,(n + 1) = W,(n) _— Bexp[—j(wonT + 6.) ex Y2n—-ky 
k=L,---,L—1. (49) 


The step size 8 is chosen to obtain reasonably fast tap convergence 
without the algorithm becoming unstable. The problem is similar to 
that addressed in Ref. 10 for es ta equalizers where it is proved 


dled thin athwm neon wmannsla tA catia: ms ARM abun 
LLIAL LIT SUCp 5120 nEEGS tO SariSr Ly. the CUOLISUL ALLL 


0< ay Ae TS 
o<rp 

where L is the total number of taps, (2L + 2M) in our case, and (x”) 

is the average input signal power. The fastest initial convergence is 

achieved with 


1 
= : 50 
e L (x’) Se 





Vi. COMPUTER SIMULATION 


The performance of the system over three channel models is evalu- 
ated. Channel 1 has a flat amplitude response over the entire voiceband 
frequency with little delay distortion, except near the lower band edge. 
Channel 2 has moderate amplitude and delay distortion, which just 
meets the private-line conditioning, and Channel 3 has severe ampli- 
tude distortion. These characteristics are shown in Fig. 6. Pseudo- 
random digital data at levels +1 and +3 are used to modulate the 
quadrature amplitude modulation (QAM) transmitter at a rate of 9.6 
kb/s and the resulting signal is sent over one of the channels. Additive 
Gaussian noise and, whenever desired, phase jitter are introduced into 
the channel. The input signal-to-channel noise ratio (s/n;) will be 
taken to be either 30, 24, or 20 dB. 

In the simulations, the tc filters {C;,} and {W,} are allowed to start 
adapting after 1000 iterations to ensure that the LE has converged 
sufficiently and is able to provide good tentative decisions for the Lc. 
The performance of the Lc is measured by the output s/n 
(s/n,). This is defined as the ratio of the average data symbol power to 
the output noise power, which is taken to be the time average of the 
squared error at the Lc output. We exhibit plots of s/n, as a function 
of time, expressed in number of data symbol intervals, for both the LE 
and uc. Scatter plots of the output signal constellation are also pro- 
vided. In the absence of any impairments, the plot would show 16 
points with the x and y coordinates at +1, +3. 

In the simulations, both tc filters span 31 data symbols so that the 
matching filter has 62 7/2 taps, while the canceller has 31 taps. The 
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Fig. 6—Channel characteristics. (a) Channel 1. (b) Channel 2. (c) Channel 3. 


LE has 64 7/2 taps to sufficiently span the channel-impulse response. 
An initial step size of 0.0005 is chosen for updating all the tap coeffi- 
cients and it is reduced to 0.00005, when the taps have nearly con- 
verged, to get a small final mse. 

We first compare the performance of the Lc and LE over the three 
channels described earlier using an s/n; of 24 dB. The receiver switches 
from the ideal reference to the decision directed mode of operation 
after 3000 iterations. Figures 7, 8, and 9 show the receiver s/n, as a 
function of time for the three channels. 

Note that in the absence of significant distortion, as in the case of 
the Channel 1, the LE and Lc perform equally well, and we see that 
s/n, is approximately s/n;. However, the LE degrades more than the 
Lc when the channel is more severely distorted in amplitude. 

Table I summarizes the results of several simulation runs, for the 
LE, DFE, and Lc, over Channels 2 and 3 and with various s/n;. Results 
for Channel 1 are not presented because all three schemes do not 
suffer any significant degradation. These results are based on the use 
of an ideal carrier demodulating phase. Note that for Channel 2, the 
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Fig. 7—Performance of LE and Lc over Channel 1. 
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Fig. 8—Performance of Lc and LE over Channel 2. 
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Fig. 9—Performance of Lc and LE over Channel 3. 


Lc is able to attain s/n.’s which are very close to s/n,, for all three 
cases. The gain is about 2 dB over the LE and about 1 dB over the 
DFE. If we can use the rule of thumb that every 1-db gain corresponds 
to an order of magnitude reduction in error rate, then the Lc would 
have an error rate two. orders of magnitude less than the LE and one 
order less than the DFE. The results for Channel 3, again, show the 
improvement over both LE and DFE. We see that the LE performance 
degrades significantly when the channel has considerable slope distor- 
tion, and on the average, it suffers losses of about 4 dB from the input 
s/n. The Lc shows improvements of about 3 db over the LE and 1 dB 
over the DFE in all three cases. 

We conclude this section by comparing the scatter plots of the 
receiver outputs after processing by an LE and an Lc. In this example, 
Channel 3 is used and the s/n; is 24 dB. Figure 10.is the result of linear 


Table |—Comparison of linear equalizer, decision feedback 
equalizer, and canceller performances 


Channel 2 Channel 3 
s/n; (dB) 20 24 30 20 24 30 
Linear equalizer s/n, 18.0 22.1 268 154 204 £25. 
Decision feedback equalizer s/n, 19.1 23.0 279 17.8 22.0 27,1 
Canceller s/n, 19.7 23.8 29.2 185 23.1 28.7 
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equalization. There is a considerable point spread because of severe 
amplitude distortion. Figure 11 shows the result of cancellation, which 
considerably tightens the spread and provides a larger noise margin 
than equalization. 


Vil. CONCLUSION 


Cancellation is a powerful alternative approach to linear equalization 
that strives to mitigate the effects of slope distortion. This technique 
is based on cancelling the sidelobes of the overall channel-impulse 
response, unlike equalization, which attempts to invert the overall 
channel. 

Simulation results of QAM transmission at 9.6 kb/s have shown that 
the canceller performs impressively, especially for severely amplitude- 
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Fig. 10—Linear equalizer output signal constellation. 
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Fig. 11—Linear canceller output signal constellation. 


distorted channels, where neither the LE nor the DFE is satisfactory. 
The Lc needs an LE to provide it with reasonably good tentative 
decisions to perform the cancellation. It is then able to provide a final 
output which has a lower mse than the tentative output. In addition, 
the canceller is insensitive to phase distortion, provided the matching 
filter has an input sampling rate at least twice the data symbol rate. 
Hence, the Lc accommodates the task of a fractionally spaced LE as 
well. 

In summary, we have introduced a fundamentally new approach 
that performs better than either the linear or decision-feedback equal- 
ization, as confirmed by simulation results. Moreover, a recent theo- 
retical study” applicable to this cancellation method further confirmed 
the effectiveness of our approach. 
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APPENDIX 

We derive the optimum minimum mse Lc filters C and W under the 
assumption of (z) uncorrelated noise samples, and (iz) correlated noise 
samples. 

Case 1. Uncorrelated noise samples 

In Case 1 

E[a(lT/2)a*(kT/2)] = 0°8n, 


where o” is the noise variance. Consequently, we have, from eqs. (10), 
(11), and (15), 


E(UnAn-m) = ¥, Wiexp(JwokT/2) h*[(2m — k)T/2], (51) 
k 


E(VnAn-m) = Cm m #0, (52) 
E(y2n-hYan—m) = exp[ Jwo(k — m)T/2] Vy E{Aph*[(2n — k)T/2 
Pq 


— pT]+ ai [(2n — k)T/2]}} 
X {A,A[(2n — m)T/2 — qT] 
+ ai{(2n — m)T/2]} 

= exp[jwo(k — m)T/2] x h*(pT — kT/2) 


x h( pT — mT/2) + orm 
= Ri(k — m) + 0° Skm; (53) 
where 
exp (Jw) » h*(kT + T/2)A(RT + T/2 +7), lLodd 


Ri(r) = (54) 
exp(jwot) ¥ A*(kT)A(RT + 7) 1 even 
k 


and we also define another term, 
Ri(L) = ¥ h*(JT/2)hA(GT/2 + 1T/2), (55) 
J 


which will be used later. Equation (55) is the autocorrelation function 
of the 7'/2-sampled impulse response h(jT/2), whereas eq. (54) is T- 
spaced. So, from eq. (11) 


E(Un Yen-m) = exp(jwonT) ¥) WrE( y2n-2Yen-m) 
k 


= y W.Ri(k —m) + o°Wr exp jannT). (56) 
k 
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Also, 
E(A*-nyon-m) = exp[ jooo(2n — m)T/2] Y E(A* 2 Ap) 
P 


xX AL (2n — m)T/2 — pT] 
= exp[ Jwo(2n — m)T/2]h[(2k — m)T/2]. (57) 


Therefore, 
E(Viyon-m) = exp[jwo(2n — m)T/2] ¥, Cih[(2i — m)T/2], (58) 
tx¥40 


and 
E(Aiyen-m) = exp[jwo(2n — m)T/2]h(—mT/2). (59) 
By substituting eqs. (51) to (59) into eqs. (17) and (18), we have 
Cn = x W,exp(jwokT/2)h*[(2m — k)T/2], m # 0, (60) 


and 
Y WRK — m) + onWn = exp(—jwomT/2) 
k 


x ¥ CAL (2 — m)T/2] + exp(—jwomT/2)h(—mT/2), allm. (61) 
i=0 


We next solve for {C;} and {W;} from this pair of equations. Substi- 
tuting eq. (60) into eq. (61) gives 


YY WRE(R — m) + o2Wm = exp(—jwomT/2) YY We 
k 140 k 


X exp(jJwokT/2)h*[(21 — k)T/2] 
xX AL (2i — m)T/2] 

+ exp(—jwonT/2)h(—mT/2) 

» W,[exp jwo(k — m)T/2] 


x YA*[(2i — k)T/2] 


xX Al (2i — m)T/2] + exp(—jwomT/2) 
x h(—mT/2)(1 — ¥) Wiexp 
k 


X (jokT/2)h*(-kT/2)] 
= ¥ W,Ri(k — m) + exp(—jwomT/2) 
E 
xX h(-—mT/2)(1 — B), (62) 


INTERSYMBOL INTERFERENCE 2017 


using eq. (55), and where 











B=) Wiexp(jwokT/2)h*(—RT/2). (63) 
R 
Therefore, we obtain.the equality 
o Wm = exp(—jwoT/2)h(—mT/2)(1 — B), all m, (64) 
or 
= : (1 — B) 
Wan = exp(—jwomT/2)h(—mT/2) a (65) 
On substituting eq. (65) into eq. (63), we can show that 
E;, 
OO Ete oe 
where, using eq. (55), 
T 2 
J 
= R;(0). (67) 


From eqs. (66) and (65), we see that the matching filter has T/2-spaced 

tap weights 

h(-—mT/2) 
E, + a’ 


which is clearly proportional to a matched-filter impulse response. 


Wm = exp(—jwomT/2) all m, (68) 


Case 2. Correlated noise samples 


As described in Section IV, Case 2 corresponds to the voiceband 
telephone channel where the noise has approximately the same band- 
width as the signal so that noise samples at T/2 spacing are correlated. 

With noise correlation 


E[ (akT/2)a* (1T/2)] = Ri(k — 1), 
and eq. (53) becomes 
E(y2n-zYon-m) = Ri(k — m) + Ra(k — m). (69) 
Then, in place of eq. (56), we have 


E(Un yon-m) = >, WilRi(k — m) + R(k — m)]. (70) 
k 
Consequently, eq. (62) becomes 


> W.R»(k — m) = exp(—jwomT/2)h(—mT/2)(1— B), = allm. = (71) 
k 
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We have to solve a set of linear equations for the W taps, and the 
solution in the time domain is not obvious. Instead, we examine the 
results in the frequency domain. Transforming both sides of eq. (70) 
gives 


W(w)Sn(—w) = (1 — B)H(—w —wo); 
that is, 
Ww) = (1 — 8) (—w —wo)/S,(—w), (72) 


where W(w) is the Fourier transform of the W tap weights, S,(w) is 
the sampled noise spectrum, and H(w) is the Fourier transform of the 
channel-sampled impulse response. Again, W(w) takes the form of a 
filter matched to the channel with additive band-limited noise. 
The constant 8 can be written as 
2n 


a T_ A*(-w —wo) W(w)dw (73) 


On substituting eq. (72) into eq. (73), we can solve for B as 


ou F 
1+ é& 
where 
_T (FZ |A(-o -«)| 
~ Sn | “Seay A 
T 
Consequently, from eq. (72), 
Ww) = H(—w —«») (75) 


(1 + £)S,(-w) 


Even though the C and W filters are jointly adapting, the C taps 
are, in fact, slaved to the W taps. On substituting eq. (68) into eq. (60), 
we obtain, using eq. (55), 


1 r m= 
Cm = Te h(—kT/2)h*[(2m — k)T/2] 


Thus, the canceller-impulse response (for m # 0) is that of the overall 
T-spaced impulse response of the channel and matching filter; that is, 
the canceller recreates the entire ISI component present in the match- 
ing filter output signal as long as the correct data symbols are applied 
to the canceller. 
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The C taps corresponding to eq. (75) can similarly be obtained. From 
eq. (71), we have 


Al (2m — k)T/2] 


1 
ors I 
On substituting into eq. (60), we have 
C,, = SPUanl) Ss wewtRt(am+1—k), m#0. (78) 
(1-6) ky 


If, in addition, we define 


Pec W.WiR*(1— k), 
oe eae ey ene) 


we can transform the sequence {C,,} to obtain 
= ¥ C,,exp(—jwmT) 
ee ¥ Wrexp[—j(w — wo)kT/2] 
Gp. 020" 
x ¥ Wrexp[j(w — wo)lT/2] 
I 


x Y RA(Q2m +1—k) 


X exp[—J(w — wo) (2m + 1 — k)T/2] (79) 


Since both indices k and / run from —* to ©, we can replace the 
summation over m as 


>; Ri(p)exp[—j(w — wo)pT/2] = Sn (— w + wo) 


= S,(— w + wo). 
Therefore, eq. (79) can be simplified to 


C(w) = —s | W(w — wo) |7S,(— w& + wo). (80) 


Substituting for W(w) from eq. (72) gives 
(1 — B)|H(-w) |? 


OSS 


(81) 


or, from eq. (74), 


1 |A(-w)|? 
Co) = 


1+ oP wW + wo) (82) 
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A unified theory is presented for data-aided equalization of digital 
data signals passed through noisy linear dispersive channels. The 
theory assumes that some past and/or future transmitted data sym- 
bols are perfectly detected. We use this hypothesis to derive the 
minimum mean-square error receiver. The optimum structure consists 
of a matched filter in cascade with a transversal filter combined with 
a linear intersymbol interference canceler which uses the ideally 
detected data symbols. The main result is an expression for the 
optimized mean-square error as a function of the number and loca- 
tion of the canceler coefficients, the s/n, and the channel transfer 
function. When the number of canceler coefficients is zero, we get the 
well-known result for linear equalization. When the causal or post- 
cursor canceler approaches infinite length, we obtain the well-known 
decision feedback result. When both the precursor and postcursor 
cancelers become infinite, we obtain the very best result possible, 
namely, the matched-filter bound dictated from fundamental theoret- 
ical considerations. Neither the decision feedback nor the matched- 
filter results can be achieved in practice since their implementation 
requires infinite memory and storage. Our theory can be used to 
calculate the rate of approach to these ideals with finite cancelers. 


1. INTRODUCTION 


The theory of linear and decision feedback equalization to mitigate 
the effects of intersymbol interference (IsI) and noise in digital data 
transmission is well known.'~ In this paper, the problem of equalization 
is cast in a general framework of an IsI canceler aided by past and/or 
future data values. This general structure is suggested from optimal 
detection theory and is shown in Fig. 1. The optimal detector of digital 
data in the presence of additive Gaussian noise and ISI is comprised of 
a matched filter and an IsI estimator which is used to cancel the 
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Fig. 1—Block diagram of data-aided equalization. 


interference.”*® The implementation of this structure is often impract- 
ical because of its complexity.’® 

In our theory we postulate that some portion of the Ist can be 
perfectly synthesized and, therefore, subtracted from the incoming 
signal. In other words, we replace the optimal estimator with a practical 
one. The effect of the remaining interference is then minimized by a 
linear filter or a conventional linear equalizer. In practical systems, 
however, perfect estimation cannot be achieved; therefore, our results 
serve as ideal limits. The inclusion of occasional errors in our theory 
has proved mathematically intractable so far. 

In Section I], we determine the minimal mean-square error (mse) 
when an arbitrary set of data symbols is known to the receiver. In 
Section III, the optimal receiving filter is derived and analyzed. The 
performance of the infinite linear equalizer, the decision feedback 
equalizer and the infinite canceler are obtained as special cases of the 
general result. Section IV covers a discussion of numerical results. 


ll. MINIMUM MSE FOR DATA-AIDED EQUALIZATION 


In Fig. 1, the transmitter generates the data sequence {a,} whose 
elements are assumed to be independent identically distributed (i.i.d.) 
discrete random variables. These discrete amplitudes sequentially 
modulate the pulse p(t) at a rate 1/T to produce the transmitted 
signal. The pulse shape, p(t), can be viewed as the overall impulse 
response of the transmitting filter and the transmission channel. White 
noise, »(f), is added to the received signal which is then applied to the 
linear receiving filter, w(t). The output signal is sampled at the symbol 
rate 1/T and combined with the output of the canceler. The linear 
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canceler is modeled as a transversal filter with coefficients, {c,}, where 
n € S, and where the set of integers S denotes the range of the 
canceler’s taps. 

For most applications, and for the special cases investigated later in 
this Section, the range will contain the neighboring taps of the refer- 
ence location but never the reference location itself, and consequently, 
S = {-M, ---, —1, 1, ---, Neo}. The canceler operates on the past 
received symbols an-1, +++ , @n—-n, and on the future received symbols 
GQn+1, ***, Qn+n,, Which are assumed to be known to the receiver. 
Clearly, to realize an operation on the future data symbols, a time 
delay of at least N,T seconds has to be introduced. 

For a general set S the output signal x, = x(nT) can, thus, be written 
as 


Xn = YS rQn-r— DY CeOn-e + &n. (1) 
k keS 


=—00 


Where r; = r(kT) is the overall impulse response evaluated at ¢ = kT, 
ry = “al w(t) p(kT — 7)d7, (2) 


and where & = & (RT), i.e. 


& = | w(r)v(kT — r)dr, for R= —o,---,-1,0,1,---,. (8) 


—oo 


To facilitate modeling of various types of linear modulation schemes, 
the data sequence, the noise, and all impulse responses are assumed to 
be complex valued. In general, p(t) will be the preenvelope of the 
passband transmission system with respect to a carrier frequency. This 
notation has become extremely useful and economical in this field.? 
Specifically, it permits a unified presentation of baseband and pass- 
band systems. 

The output signal, x,, after slicing or quantizing is usually taken to 
be an estimate of the transmitted data symbol a,. Our goal now is to 
obtain a receiving filter, w(t), and canceler taps, {c,}, so that the mse, 


€= E{|xn — an|*}, (4) 


is a minimum. To determine the optimal canceller coefficients, {cn}, 
we differentiate eq. (4) with respect to cn, n € S, and set the result to 
zero 


re) 
=~ = oilct — rit + cn — rn] = 0, for nES, (5) 


where 
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E {anak} = 0276p, (6) 


and 6,,, is the Kronecker delta. The immediate conclusion from eq. (5) 
is that forn ES 


Cn =Tn. (7) 


Inserting this into eq. (1), we get 
Xn = D rrQn-n + { w(t)v(nT — r)dr. (8) 
kgS ae 


Thus far, our approach is perfectly obvious. By knowing the data 
symbols for all integers k € S, it is possible to synthesize the resulting 
ISI associated with these symbols and subtract it from the current 
signal sample x,. If the set S contains all integers k < n, we use all the 
already-decided-upon data symbols (available at the receiver without 
delay) to synthesize the postcursor IsI. This is precisely what is done 
in decision feedback equalization. If the set S contains all the integers, 
except the one associated with the present instant, 7, all Is! is elimi- 
nated. But this, of course, requires infinite delay. In practice, the set S 
will be finite and our main concern will be to determine how it 
influences the mse. 

We now proceed to optimize the receiving filter, w(t), for a given set 
S. Inserting eq. (8) into eq. (4) and using eq. (6), the resulting mse can 
be expressed as 


cmoi| ¥ Inl?-n-re ti 4e'], (9) 
kéS 
where 
E {v(t)v(t + r)*} = 0? T S(t), (10) 
and where 
, o {~ ; 
o = Ts |w(t)|“dt . (11) 


We remark that more general noise covariances can be included, but 
the calculations become more cumbersome without yielding additional 
insights. 
To obtain the optimum w(t), let 
w(t) = wo(t) + Au(t) (12) 


and define 
U, = r| Wo(T) p(RT — r)dr, (13) 
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where wo(t) is the optimum impulse response of the receiving filter 
and where the U; are the samples of the optimized overall impulse 
response. It follows immediately from eq. (2) that 


rp, = Uz + ar | u(r) p(RT — t)dr. (14) 
When 
OE 
aA =0 (15) 
A=0 





is calculated from eq. (9), we obtain an equation for the optimum 
receiving filter: 


Wo(t)No = p(—t)* — ¥! Unp(kT — t)*, (16) 
ke 
where 
2 
No=—. (17) 
Oa 


The interpretation of eq. (16) is standard: the optimum receiving filter 
is comprised of a matched filter p(—t)* in cascade with a transversal 
filter having taps only at those locations where the canceler has none. 
This structure is shown in Fig. 2. 

To obtain our central result, an expression for the optimal mse, we 
multiply eq. (16) by Two(t)* and integrate from —% to +, This yields 
with the aid of eqs. (9), (11), and (13) the result 

€opt = o2(1 = Uo) . (18) 


The explicit determination of Up is the subject of the next section. 


* 
p(—t) 





Fig. 2—Optimal receiving filter, wo(t). 
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Ill. THE OPTIMAL FILTER 


To determine the sample values, {U,,}, of the optimal overall 
impulse response, define the autocorrelation function of the channel 
impulse response as 


i; = r| p(mT — 7) p(—7)*dr (19) 
and its Fourier transform as 
R(w)= YX Rnewir?, (20) 


After multiplying eq. (16) by Tp(nT — +) and integrating from —o to 
+o, we get the following system of linear equations in {Un}, 


UnNo = Rn —- Y, UrRm-x for all m. (21) 
kgs 
To determine the optimal receiving filter, we only need to know U,, 


for m € S. From eq. (21) we extract the equations necessary to 
determine Up and partition them as follows 


Uo(No + Ro) = Ro — » U,.R-, for m=0, (22) 
hed 
Yo UrMm-r = (1— Uo)Rn for m€d, (23) 
ke J 


where we defined 
M,= Rit Nodzo ; (24) 


and where J is the set S augmented by m = 0. 

Note that the indices of the unknowns and the indices of the right- 
hand sides of eq. (23) have gaps of the same size and at the same 
locations. (See the definition of J above.) Thus, the set of equations in 
(23) is not in a standard form and the solution technique is not obvious. 
In Appendix A we develop a technique to solve this infinite set of 
equations with finite gaps. It involves the solution of a special infinite 
set of equations without gaps. To compensate for the gaps, we augment 
the original set of equations. Specifically, we add for m € J a finite 
number of equations to the infinite set such that the solution vanishes 
for m € J. From Appendix A, we determine that the optimum mse 
becomes 


No 


Not He’ (25) 


ee ee 
€opt = 0a 
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where Hp is determined from the following set of equations: 


by Mis nEm = 62,0 _ NoM;, for ke J (26) 


med 
and where M;,’ is the inverted sequence of M,, i.e., it satisfies 


¥ MnMiin =o,0 for allk. (27) 


In the following section, we investigate the minimal mse for some 
special cases. As mentioned initially, we use the realistic assumption 
that the set S contains the neighboring locations {—Ni, ---, —1, 
1,---, No}. Then J = {—M, -++ , No} and the coefficient-matrix in eq. 
(26) is a finite Toeplitz matrix. The solution of eq. (26) and, thus, Ho 
is unique and is guaranteed to exist when R(w) + No is bounded away 
from zero and infinity."° These conditions are very mild and are 
satisfied in most cases of practical interest. 


3.1 Infinite length equalizer 


For N, = Nz = 0 the set J includes only the zero integer and all 
canceler coefficients vanish. Consequently, eq. (26) degenerates to a 
single equation 


H)Mo! = 1 — NoMo". (28) 


Solving this for Ho and inserting it into eq. (25), we obtain the standard 
result for the optimum linear equalizer,” 


T | wT Nodw 


as RE CIE (29) 
2Qr _ n/t tw) + No 


€opt = 


where we expressed Mo" in terms of its Fourier transform 


T ("dw 
Mi=—| ————_. 30 
. =| ae ee 


3.2 Matched-filter 


When JN, = N2 = ~ the set J is infinite; i.e., the canceler subtracts 
all the 1s1. Equation (26) in this case yields 


>> Mi. n(Hn + No6m,o) = dx,0 for allk. (31) 


Comparing this with eq. (27) gives the result 
M,, = Hn + Nodmo < (32) 
From eqs.(32) and (23) it follows that H, = Rm for all m. Thus, the 
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optimum mse for this case is 
2__No 
=. 
No + Ro 
This is recognized as the matched-filter bound for the optimal detec- 
tion of a known signal in noise and in the absence of Is1. 


(33) 


€opt = 


3.3 One-sided canceler of infinite length 


For N,; = 0 and Ne = & the canceler performs as an ideal decision 
feedback equalizer of infinite length. In Appendix B the mse is derived 
for the more general case N, # 0, i.e. for a decision feedback equalizer 
with a limited number of noncausal taps. The result is 

No 


€opt = o%y——— ’ (34) 


1 
y |Me|? 
k=0 
where the coefficients Mj are determined from the following equation 


Ys MiMn--=M, for allm. (35) 
k=0 
Here, {M7} is the causal “root” of the two sided sequence {M;}. It 
satisfies Mz = 0 for k < 0 and Mj = (M7_,)* for k = 0. It is shown in 
Ref. 3 that 


n/T 
_ |Mé|? = No exp El In (= < 7 ao], (36) 
0 


—2/T 





and when this formula is inserted into eq. (34) we get the well-known 
result for the decision feedback equalizer, 


T(™  (R) 
eta? =, 


—a/T 





Unfortunately, there is no similar simple expression for |Mi|’, for 
k # 0; therefore, we are forced to numerically factor the two-sided 
sequence {/M,;} into its causal and anticausal root. 


IV. DISCUSSION OF NUMERICAL RESULTS 


In this section, the minimal mse of data-aided equalization is eval- 
uated numerically for certain channels and for various sets of canceler 
taps. We will exhibit and discuss the behavior of the mse, €op:(Ni, N2), 
as a function of N; and N2 for typical telephone channels. As a point 
of reference, note the following easily proved inequalities: 


Eopt (0, 0) = Eopt(0, 00) = Eopt(%, 00), 
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In the following, we examine three types of cancelers: 

(t) Starting from the infinite length linear equalizer whose mse is 
Eopt(0, 0), we increase the number of causal canceler taps; i.e., we 
examine €op:(0, N2) for Ne = Q=0,--- , 15. 

(iz) Starting from the infinite length linear equalizer, we increase 
the number of known data symbols alternating between causal and 
noncausal ones; i.e., we examine €o(Ni, No) for Q = Ni + No = 
0, --- , 15 and where N, = Ne for Q even, No = N, + 1 for Q odd. 

(tit) Starting from the infinite length decision feedback equalizer 

whose mse is €op¢(0, 0), we examine the behavior when noncausal taps 
are added; 1.e., €opt(Ni, ©) for Ni = Q=0, ---, 15. 
Equation (25) is used to determine the mse for cases (i) and (ii), where 
Hh is obtained as the solution of eq. (26). To determine the sequence 
{M;,"} for all k, we observe that the Fourier transform of the sequence 
{R;} isrelated to the overall transfer function of the channel as follows: 


on\ |* 

P(« k =) 

Clearly, R (w) is periodic with period 27/T, and it is only dependent on 
the magnitude of the overall channel transfer function, P(w). There- 
fore, phase distortion in the channel has no effect on the mse. This is 
reflected in the well-known fact that phase distortion can be perfectly 
equalized without noise enhancement. Therefore, the sequence, 
{M;,'}, is obtained as follows 


foe] 


R(w)= Y 


k=-00 








(38) 


M;, = 


a/T jkoT 
T | ee?" dw (39) 


2a _a/t Et (@) + No’ 


where j = V—1. Fast Fourier transform techniques are used to evaluate 
eq. (39). Numerical tests show that it suffices to take 64-128 samples 
of R (w) + No in the interval [—7/T, 7/T']. The fact that the coefficient 
matrix in eq. (26) is positive definite and Toeplitz, makes it possible to 
obtain the desired solution, Ho, recursively. This is done with the 
Levinson algorithm.’ 

For case (iii), we evaluate eq. (34). The sequence, {M;}, is obtained 
from the following approach.’ First determine {F;,} for all k such that 


In(M(w)) = y Fre 7*°7, (40) 
k=—00 


Then it follows that 


M(w)* = exp {3 rem, (41) 
=0 
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and 
T a/T 
Mi = = M(w)*eT dw . (42) 


—n/T 


Fast Fourier transforms are used to obtain consecutively {F,}, M(w), 
and M;*. The overall channel power transfer function, |P(w)|’, is 
assumed to consist of a raised cosine shaped transmitting filter with 
relative excess bandwidth, a = 0.15, 


| T(w)|? 
1 for |w| < (1 — a)a/T 


. a\ T T Ci 
05 |1~sin(w- 2) 2 for (1 — a) F< < (1 +a) = 


. r\ T 7 7 
o5[1+5in (+5) 2 | for — (1 + a) .<o<—(1— 2) 7, 


elsewhere 


and cascaded with the channel power transfer function |G(w) |?. 

Figure 3 shows two different channel power transfer functions, 
|G(w)|?, which are used to derive the subsequent numerical results. In 
(a) we show the equivalent baseband transfer function for the worst 
channel meeting the basic conditions of private lines (BASICBAD).” 
Part (b) shows a transfer function (CABLE) with linearly increasing 
attenuation. The parameters P; and P2 indicate the attenuation at 
w® = —7/T and w = 7/T. A model for a baseband cable channel is 
obtained when P; = P». 

Figure 4 shows the mse as a function of the number of canceler taps 
for the various channel transfer functions and for s/n of 20 dB at the 
receiver input. The dotted line represents type (i) canceler; the dashed 
line, type (ii); and the solid line, type (iii). The curves for types (7) 
and (ii) start at the minimal mse for the infinite length equalizer and 
the curve for type (iii) starts at the minimal mse of the infinite decision 
feedback equalizer. 

As can be observed, all curves converge very rapidly to their asymp- 
totes. The curve for type (i) indicates that only 3 causal coefficients 
suffice to closely approximate the performance of an infinite decision 
feedback equalizer. The curve for type (ii) suggests that a total of 6 
coefficients (3 causal and 3 anticausal) results in a performance which 
is very close to the optimal (the matched-filter bound). The curve for 
type (iii) reaches very close to the mse obtained from the matched- 
filter bound with only 3 noncausal coefficients, in addition to an infinite 
decision feedback equalizer. These results are virtually independent of 
the channel involved. 
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|Gtw)]? (4B) 


0 


-1.0 —0.5 0 0.5 1.0 1.5 


(a) 


|evw)|” [dB] 





wT/ 7 
(b) 
Fig. 3—Power transfer functions. (a) BASICBAD channel; (b) CABLE channel. 


The channel, however, influences the best mse which is obtainable 
with the infinite equalizer, 1.e. €op:(0, 0), and with the infinite decision 
feedback equalizer, i.e. €op:(0, 0). Table I shows these figures for the 
various channels. 

The minimal mse obtained from the matched-filter bound is —20.04 
dB. Therefore, 1.8 to 4.7 dB can be gained for the channels considered 
if a canceler of three causal and three noncausal coefficients is included. 


APPENDIX A 
Solution of an Infinite Set of Equations with Finite Gaps 


Let 
M;, = Rr + Nodzo, (43) 
and consider 
YS UrMn-z = (1— Uo)Rn for mEéd, (44) 


kd 
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€ opt (Ny, V2) IN DECIBELS 


E opt (Ny, N2) IN DECIBELS 


(c) 


O CAUSAL TAPS, Nj =0, N2=Q0, FOR TYPE (/) CANCELER 


O CAUSAL AND NONCAUSAL TAPS, 
Q=N,+N2, FOR TYPE (ii) CANCELER 


A DECISION FEEDBACK WITH NONCAUSAL TAPS, 
N;, =Q, No =° FOR TYPE (iii) CANCELER 


€ opt(W1, Nz) IN DECIBELS 





Q COEFFICIENTS 


Fig. 4—Mean-square error for data-aided equalizations. (a) BASICBAD channel; (b) 
CABLE channel, P; = P, = —10 dB; (c) CABLE channel, P; = —10 dB, P2 = —20 dB. 
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Table I—Minimal mean-square error 


coo Decision 
Feedback 
oo Equalizer Equalizer 
Mean-Square Error op: (0, 0) in €opt (0, ©) in 


Channel dB dB 
BASICBAD —18.05 —19.16 
CABLE 
P, = 10, P2 = 10 —18.22 —19.13 
P, = 10, P. = 20 —15.35 —17.97 


where J is a finite set. It always contains the number zero but is 
otherwise arbitrary. Equation (44) is an infinite set of equations with 
a finite gap in both the indices of the unknowns U, and the right-hand 
sides Rm. Notice that eq. (44) reduces to a discrete convolution and, 
therefore, is easy to solve if k and m are allowed to take on all the 
integers, or if the gap could be removed somehow. 

Now consider instead of eq. (44) the following set of equations 


YD VeMmn-2z = (1 —- Uo)(Rm— Hn) for allm, (45) 
k=—0o 


which is a discrete convolution. In order that eq. (45) conform to eq. 
(44), the auxiliary sequence {H,,} must satisfy 


H,=0 for m€éJd, (46) 
and 
Vn =O for med. (47) 


To accomplish this, the values of H,, for m € J are determined such 
that these constraints are forced to be satisfied. This is always possible 
since there are N, + N2 + 1 free parameters, H,,,, and the same number 
of conditions on V,,. From the above, it follows that 


U, = Vi for kéd. (48) 


This is easily proved by subtracting eq. (45) for m€ J from eq. (44). 
Now define the sequence ey z } such that 


y M,M;7.» = bm,0 5 (49) 


k=—0 


It can then be shown that 


Vi=(1— Uo) ¥ (Rn -Hn)Mi*n for allk. (50) 


m=—-o 
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Since V; = 0 for k € J, we conclude from eq. (50) that 
Y AnMim= Y RnMiim for RE, 


and since H,, = 0 for m€& J, it follows that 
Yo AnMim = 8:0 —-NoM;' for RE, (51) 
med 


where we used eq. (43) and eq. (49) for the right-hand side. Eq. (51) 
can be solved if the Toeplitz matrix generated by the sequence {M;,"} 
is not singular. This is always the case when M(w) = R(w) + No is 
bounded away from zero and infinity, i.e. for all systems of practical 
interest.’ 

For evaluation of the mse, we need 


¥ U,R_-, = > Vi.R-» , (52) 
keJ k 


where the equation holds because V; = 0 for k € J. Since J always 
contains the number zero, we conclude from eq. (43) that R_-, can be 
replaced by M_,. This yields together with eq. (52) and eq. (45) 


> Ur R-2 = (1 — Uo)(Ro — Ao) . (53) 
Aged 
We use eq. (53) together with eq. (22) to obtain 
Ap 
1° N+ He ve 
which finally leads to 
No 
ane 
Eopt Ca No + Ho ’ (55) 


the desired main result. 


APPENDIX B 
Analysis of the One-Sided Canceler of Infinite Length 


For N, finite and N2 = ©, the set of equations (24) which determines 
U; reads as follows: 


YS UsMm-2=(1—Uo)Rm for m<-—N,. (56) 


k<-N, 


To solve this one-sided convolution, we factor the sequence {/,,} into 
a causal part {M7} and an anticausal part {M7}; ie., 


Mn = Y MiMa-n, (57) 


n=0 
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where 


M;=0 for n<0 (58a) 
M,=0 for n>0. (58b) 
We now define a sequence Y, such that 
YS MiYmn-n = (1 — Uo) Rm forall m. (59) 
m=0 


Now insert eq. (57) into eq. (56) to obtain 
YS Un Y MiMa-n-n =(1—- Ud)Rm for m<—-N,. (60) 
k<—-Ny n=0 


In addition, substract eq. (60) from eq. (59) for m < —N, and obtain 
the following set of equations 


Yn= ¥ UpMn-» for m<-N,. (61) 


k<-N, 


Multiply eq. (61) by M7, sum over all m < —N,, and use eq. (57) on 
the right-hand side to obtain 


Y YrnMt,= Y UnM-,. (62) 
k<-N, ke—N, 
Recall that 
M), = Ri + Nodz00 , (63) 


and compare eqs. (57) and (59) which determines Y,, as 
Yn =(1-—- U.))Mn for m0. (64) 


Now insert eq. (64) into eq. (62) and make use of eq. (63) once more to 
obtain.the one-sided sum which is required in eq. (53): 


 ReUr=(1- Us) Yo MaM*n. (65) 


hk<-N, m<—N, 
Since M;, = M*,,, it can be shown that 


XY R-~Uz=(1-— Ud) ¥ | M;,|” 


k<-N, m<—N, 


=(1—Uo) ¥ [Mz|?. (66) 


m>N, 
Also, insert eq. (66) into eq. (53) and use eq. (63) to find 
No+ Ho=Mo— Y |Mi 2 (67) 


m>N, 
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With eq. (57) evaluated for m = 0, we finally obtain 
M 


No + Ho = > |M7,|?, (68) 


m=0 
and with eq. (55) we get our desired result, 
No 


€opt = of es . (69) 


y |Mn|? 


m=0 
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The general problem of equalization for data transmission where 
one of the two data sources produces continuous amplitude data 
samples is studied. There are various ways to configure a modem for 
such a transmission scheme, and we describe how a standard quad- 
rature amplitude modulation structure can be modified to operate in 
this mode. This solution can be specialized to include various linear 
modulation schemes, such as single sideband and vestigial sideband. 
Theoretical analysis shows that adaptive equalization and adaptive 
phase tracking can be achieved with similar quality as in the familiar 
digital-only modem. We provide extensive computer simulation re- 
sults which confirm the validity of our theory. 


l. INTRODUCTION 


Recently, considerable interest arose in finding ways to transmit and 
receive digital and analog data simultaneously over the 2-wire voice- 
band telephone channel. We investigate a system using quadrature 
amplitude modulation (QAM) where digital and analog data modulate 
the two quadrature channels. The same scheme was independently 
proposed in Ref. 1. The effects of various channel impairments, as well 
as an imbalance of signal powers between the digital and analog 
signals, on the error probability of the system has been studied in 
Ref. 2. 

In this paper, we analyze an adaptive equalizer for this type of 
hybrid modulation, using a cross-coupled transversal filter as described 
in detail by Falconer in Refs. 3 and 4. The difference here is that one 
of the two quadrature channels transmits analog, i.e. continuous am- 
plitude, data. If the communication channel is time invariant, it is 
possible to train the equalizer with digital data on both quadrature 
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channels and freeze the equalizer taps after convergence. Analog data 
can then be sent. However, possible changes in channel characteristics 
warrant the use of an adaptive equalizer to update the taps continually. 
Since references for the analog data are not available, especially when 
the receiver is in a decision-directed mode, the update algorithms 
reported in Refs. 3, 4 are not applicable in this case. We propose a 
modification which only minimizes the mean-square error (mse) in the 
digital data path. Thus, only the error signal from this path is required 
for adaption. This analysis is similar to that in Refs. 5 and 6. 

Results from computer simulations are given to verify our analytical 
results. We observed that because the analog data does not aid in the 
equalization, it actually acts as an interferer. As such, it would seem 
advantageous to reduce the analog signal power, thus, unbalancing the 
system. On the other hand, this would degrade the analog s/n. There- 
fore, there is a trade off in allocating different power levels to the two 
channels depending on which signal is more important. 

In the scheme described here, we assume that data symbols are sent 
every 7 seconds. Thus, the analog signal has to be limited in bandwidth 
to 1/(2T) in order to avoid aliasing. Alternatively, the two quadrature 
channels can be used to transmit primarily analog data with occasional 
digital data for purposes of equalizer updating. Then we could have 
one high-rate analog channel and a low data rate digital channel. 


ll. MATHEMATICAL MODEL 


The general QAM transmission scheme of Fig. 1 is considered. Two 
data sequences, {a,} and {b,}, are applied to the in-phase and quad- 
rature inputs of the cross-coupled transmission filters with impulse 
responses g,(t) and g,(t). Their output signals are modulated by sine 
and cosine waves of carrier frequency wo to form the passband signal 


{an} 






cos Wot 


—sin Wo t 


Fig. 1—Model of transmitter and channel. 
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s(t) = Re b DnaG(t — nTexp je ; (1) 


where T is the symbol interval, 
Dn = Qn + Jbn (2) 
and 
G(t) = gplt) + jgalt). (3) 


In the above equations, {D,,} is the complex sequence of data symbols 
and G(t) is the complex impulse response of the transmission filter. 
These parameters can be specialized to represent any linear modula- 
tion scheme,’ e.g., amplitude/phase modulation, single sideband (ssB), 
vestigial sideband (VSB), and QAM. 

Throughout this paper it will be assumed that at least one data 
sequence, for example, {a,} is digital. In particular, we will report 
results for a system with G(¢) real and where {a,} and {b,} are digital 
and analog data sequences, respectively. These sequences are assumed 
to have zero-mean and the following correlation properties: 


E {aQnQm} = Pabrm (4a) 
E {bnbm} = Podnm (4b) 
E{Gabn} = 0 | (4c) 
Bates i n=m af 
0, otherwise 


After passing through a noisy, dispersive, and phase-jittered channel, 
the signal at the input to the receiver can be expressed as 


q(t) = Re( expC ila + A(t) ]} x D,U(t - e)) + 7(b), 


where U(t) is the combined complex impulse response of the trans- 
mitting filter and the baseband component of the pass-band channel 
h,(t) with respect to the carrier frequency wo. The noise process 7(¢) is 
independent of the data sequences, while the phase shift 6(t) caused 
by the channel is assumed to vary much more slowly than the channel- 
impulse response, f(t), and is typically about 10 degrees peak-to-peak. 
It is mutually independent of the additive noise process, as well as of 
the data symbols. 

The general QAM receiver is shown in Fig. 2. The received signal q(t) 
is bandpass-filtered by the phase splitter pair with impulse response 
F(t) = f(t) + 7 F(t) where f(é) denotes the Hilbert transform of f(t). 
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ran 
exp(—/@y7—@n) 






QUANTIZER 


Fig. 2—Receiver structure. 


The pair of outputs of the phase splitter at time kT is written as the 
complex signal sample 
Xp = Xn + Xk 
= exp[ /(wokT + 9:) | > Di Hy-1 + Nz, (5) 
Ll 


where {H, = H(r + kT)} are the samples of the overall complex 
baseband equivalent impulse response, and {N;} are the complex 
samples of the filtered-noise process. The latter process is uncorrelated 
with the signal and has an autocorrelation Ryn. Thus, we have 


E{N,Nm} = Rnn[(n — m)T] (6a) 
E{N,Nm} = 0 (6b) 
E {a,Nm} = 0 (6c) 
E {brNm} = 0 (6d) 


for all integers n and m. 

The complex signal sequence {X;} is passed through a cross-coupled 
passband equalizer with 2M + 1 complex taps, the output of which at 
time nT is given by 


M 
Q, = Y CiXnxk 


k=—-M 
= C*X,, (7) 
where we write the complex tap as 
Cy = Cr + Jc, cr and ¢;, real 
and define the vectors 
C’ =([C_m, --- , Cu] (8) 
Xh = [Xnem, +++, Xn-mu]. (9) 
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We use the * to denote conjugation for scalars and conjugate trans- 
position for vectors and matrices. The symbol ¢ denotes transposition. 

The signal Q, is demodulated to baseband by multiplication with 
exp(—j6, — jwonT), where 6, is the estimated phase offset (or jitter) 
at time nT. The resulting signal Y,, can then be written as 


Yn = Q, exp(—jOn — jwonT) (10a) 


The demodulated outputs y, and y, are estimates of the transmitted 
data samples. In the following section, an optimum equalizer tap vector 
is derived which minimizes the mse of an appropriate cost function. 


lll. OPTIMIZATION OF EQUALIZER COEFFICIENTS 
3.1 Analysis of the minimum mse criterion for the in-phase branch 


In Refs. 3 and 4 an optimum equalizer that minimizes an mse 
criterion was derived. The mse was defined as . 


E{| Y, — D,|?] = E[Re?(Y, — Dn) + Im?(Y, —D,)] 
= E[(yn — Gn)?] + EL(¥n — bn)7], (11) 


which is the sum of the mse’s in both branches of the equalizer output. 
It was found that the optimum equalizer coefficients can be calculated 
adaptively provided the complex output error Y;, — D, is available to 
the receiver. While this is the case for a transmission system with 
digital data in both branches where references can be estimated easily, 
it is not for the system considered here. In this application only one 
reference sequence is assumed to be available. Consequently, only the 
error signal in this branch can be made available for updating purposes. 

In the following discussion, we define an optimum tap vector which 
minimizes the mse in that branch where a reference signal is available 
or can be estimated easily. The resulting tap vector will be compared 
with the result for the case where both references are available. 
Assuming we have a reference for {a,}, we define the mse in that 
branch as the cost function to be minimized 


én = E[(¥n — An)” ] (12) 
with 
Yn = Re[C*X, exp(—jO, — jwonT)]. (13) 
It is convenient to express y, in vector-matrix notation as follows: 
Yn = C'T(A8,)X,, (14) 


where we partitioned the complex vectors C of the passband equalizer 
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coefficients and X, exp(—j6, — jwonT) of the ideally demodulated 
received signal to get real vectors C and X, with twice the original 
dimension 


C’ = [Re(C’) |Im(C)] (15) 

X, = {Re[X, exp(—JOn — jwonT)] 
= |Im[X;. exp(—jOn — jwonT)]} (16) 
AOn = On — Bn. (17) 


In eq. (14), T(A@) is a transformation matrix expressing the effect of 
the demodulating phase error A@,, and is defined as 





(18) 
Note that this matrix is orthogonal; that is, 
T(a) X T'(a) = T(a) X T(a) = I. (18a) 
Furthermore, 
T(—a) = T*(a) (18b) 
and 
T(a + B) = T(a) x T(B) = T(B) X T(a). (18c) 


In order to get the mse eq. (12) as an explicit function of the coefficient 
vector C, we introduce the autocorrelation matrix A of the demodu- 
lated received signal 


A = E{X:Xn} (19) 


and the cross-correlation vector V between the demodulated received 
signal and the reference 


V = E(Xza,). (20) 
With eqs. (14), (19), and (20), we can express eq. (12) as follows: 
& = C’'T(AG,)AT(—AG,)C — 2C'T (AOn)V + E (az). (21) 


Setting the partial derivatives with respect to C to zero yields the 
vector equation for the optimum tap vector Copt 


AT (—A@,)Copt = V- (22) 
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The definition of A in eq. (19) ensures that it is positive definite. 
Consequently, the equation has a unique solution. 


Copt = T(A0,) AV. (23) 
Inserting eq. (23) into eq. (21) yields for the minimum mse 
espt = E (an) — Cop, T (An) V = E (an) — VA“V. (24) 


In Appendix A, we show that, for stationary data sequences {a,} and 
{b,} uncorrelated with the noise, the autocorrelation matrix A and the 
cross correlation vector V are independent of the time instant n. It is 
important to note that the minimum mse is independent of the 
constant demodulation phase error A@,. This is a consequence of eq. 
(23) and indicates that even by minimizing the mse in one of the two 
equalizer outputs, the optimum coefficient vectors can take care of any 
phase error, in the same manner as in a cross-coupled equalizer which 
minimizes the total mse at the output. 

These facts have been reported in Refs. 5 and 6 for vsB- and SSB- 
modulated pulse amplitude modulation signals. Our analysis shows, 
however, that this holds in general for stationary sequences {a,} and 
{b,}. Thus, the independence of the minimum mse on the demodula- 
tion phase is a property of the cross-coupled equalizer which is not 
adversely affected by the particular selection of the cost function nor 
by the nature of the sequence {b,}. 

When the power of the two data sequences is balanced, i.e., Pa = Ps, 
it can be seen from eq. (80) that the resulting equation for the optimum 
tap vector coincides exactly with the equation resulting from minimiz- 
ing the total mse in both branches. In this case, both methods will give 
the same optimal coefficient vector and the same total mse. 

In all our analysis we have assumed uncorrelated data as described 
by eqs. (4a) through (4c). If, instead of eq. (4b), we have 


E(bnbm) = Ri(n = m), 


then the expressions for Ai(R, /) and Ao(k, /) in Appendix A would be 
more complicated but they would remain stationary matrices. Then, 
assuming correlated data, eq. (24) is in general still valid, except that 
V and A are more complicated than the expressions derived in Appen- 
dix A. 

Although we have based our analysis on a symbol spaced equalizer, 
we can also handle fractional spacing and derive similar results. As an 
example, we can view the 7'/2 equalizer as two parallel symbol-spaced 
equalizers, where the first of the two data samples during each baud is 
processed by one of the equalizers while the second is processed by the 
other equalizer. Let us denote the two equalizer tap vectors asC, and 
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C2 and the input vectors as X,, and X n41/2. Then we can define 


C’ = [Re(C%) | Im(Ci)| Re(C2) | Im(C3)] 
Xn = {Re[Xn exp(—jOn — jwonT)]|Im[X» exp(—jOn — jwonT)] 
|Re[Xn+1/2 exp(—JOn — jwonT)] 
| Im[Xh41/2 exp(—jO, — jwonT)]} 


[ Ti(a) 0 7] 
T(a) = =}. , 
0 T;(a) 


where 7'(q) is given in eq. (18). With these definitions, all the previous 
results for Cop and €2,: in eqs. (23) and (24) follow. 


3.2 Mean square error in the quadrature branch 


We now analyzed the mse in the second branch of the equalizer. 
The output of this branch is 


Jn = Im[C*Xn exp(—j6, — jwonT)]. (25) 
Since 


Im{Z} = Re | Zexp (-7 =) |, (26) 


we are able to express ¥, in terms of C and X defined in eqs. (15) and 
(16) using the transformation properties in eq. (18) 


hy = CT (0, = 5) X,.. (27) 


Therefore, the mse in the second branch can be expressed in vector- 
matrix notation as follows: 


EX = Bl(Sn — Br)? (28) 
(ao, —*) ar (—a0, 42 
- 07 (as, 5) ar ( ad, +2) C 
-20'T (40 - 3) E(Xabn) + E(B3). (29) 


Using eq. (18) and an approach similar to eq. (66) through eq. (70) in 
Appendix A, we show that 


T (-3) E{Xibn} = V(P2/P2). (30) 
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On substituting eq. (30) into eq. (29) we obtain 
&=CT (a0, _ z) AT (as, i z) Cc 


— 2C'T (A0,)V a +P». (31) 


For the balanced power case, i.e., P. = P;, Appendix A shows that 


T (- z) AT (3) =A, (32) 


and it follows that 
€? = C'T(A0,,) AT (—A8@,,)C — 2C’T (AG,)V + Pa. (33) 
This is exactly the same expression as for €” in eq. (21). Consequently, 


é? = ¢’, (34) 


Thus, we conclude that the mse’s in both branches are equal. In this 
special case, minimizing the mse in one branch also minimizes the mse 
in the other branch. 


3.3 Analysis of an infinitely long equalizer 


While the formal solution eq. (24) for the minimum mse already 
shows the independence of a constant-phase error, it does not reveal 
anything about the influence of channel parameters (amplitude, or 
phase distortion) or of the sampling instant. To obtain further insight 
into this dependence, we analyze an infinite length equalizer. 

We show in Appendix B that the resulting minimum mse for an 
infinite tap equalizer can be written as 





fs BRS Pi[Z(w) + Z(-w)] + 1 
€opt = On |- 4PP,Z(w)Z(—w) dw, (35) 
+ (Pa + Ps)[Z(w) + Z(—w)] + 1 
where 
= Hig(w) : 
eye tay (36) 








In eq. (36) H.q(w) is the Fourier transform of the sampled impulse 
response H(t + kT), where 7 indicates the sampling instant. It is 
related to the transfer function H(w) as follows: 
> exp(jwt) = 2w : T 
Heq(w) = T yt (. + n=) exp (ian *) (37) 


R=—00 
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and | Neq(w) |? is the baseband component of the noise power spectrum 


| Neq(w) |? = 2 Ryn(RT) exp[—7 (w + wo)kT']. (38) 
The formula given in eq. (35) can be evaluated for all the different 
modulation schemes which can be modeled by a linear transmission 
system. The only frequency-dependent term appearing in eq. (35) is 
Z(w), the s/n of the sampled received signal. According to eq. (37) this 
will, in general, depend on the sampling instant, 7, and the phase 
characteristic of the overall channel transfer function, H(w). If the 
sampling theorem is satisfied, i.e., if H(w) = 0 for all w outside the 
interval [w:, wi + 27/T], where w is arbitrary, combining eqs. (37) and 
(36) shows that the minimum mse is only dependent on the amplitude 
characteristic of the channel transfer function and of the noise power 
density spectrum. A QAM transmission system with no excess band- 
width is an example; a system transmitting only one data sequence 
and using vSB-modulation with less than 100 percent excess bandwidth 
is another, more realistic example. In case of balanced power in the 
transmitted data sequences, i.e., Pa = P, or for Z(w) symmetric around 
o, Le., Z(w + wi) = Z(—w + w:), the mse is given by, 


TP a i re dw 


Be eee, (39) 
Qn} 2PaZ(w) + 1 


2 =_ 
Eopt | P,=P, ~ 


In eq. (100) of Appendix B we show that the partial derivative of €2p 
with respect to P; is nonnegative. Therefore, an increase in the analog 
signal power P; causes an increase in €3p:. So the analog signal acts as 
an interferer to the digital signal. 


IV. ANALYSIS OF THE GRADIENT ALGORITHM FOR JOINT 
EQUALIZATION AND PHASE TRACKING 


In an adaptive receiver, the equalizer is assumed to know the 
reference data for startup and to operate in a decision-directed mode 
when random data is being sent. In either case, the tap weight vector 
is being updated continuously. Similarly, the phase offset, or jitter, 
would be continuously tracked in order to remain in synchronism. As 
Falconer did, in Refs. 3 and 4, we assume gradient algorithms are used 
in these updatings as follows (C is now time-varying), 


Cai= f VCe2 (40a) 
n a 0 
bur = 6, - Sor 2, (40b) 
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where VCe? denotes the gradient of €, with respect to C. 
It can be shown from eq. (21) that 


VCen = 2T(A0,)[AT(A0n)Cr — V] (41a) 
and 
d 4 oT 
=z En = A GH =~ AG, AT’ Aé,, c, ie V . Alb 
a € 36, (A@,,)[AT"(A@,,) ] (41b) 


Using the definitions of A in eq. (19) and V in eq. (20) and substituting 
eq. (14) we can recast eqs. (41a) and (41b) into 





VCE; = 2E[T(A6n)Xn( yn ae: an) | (42a) 
de? aT (A8,,) 

Pht 2E Ch 9 9 cAe. p. e nn n . 42b 

aby, a | eo) 


Finally, we use another property of the transformation matrix T(a), 


namely, 
aT (a + B) 7 
———=Tlat+spr+- 
da (« B I. 


together with eq. (27) to obtain 


den . 

— = 2E[ yn(yn — Gn)]. (42c) 

d6n 
Equations (42a) and (42c) can be used to update the equalizer accord- 
ing to eq. (40). Note that all the signals required to update the equalizer 
are readily available at the receiver. Also note that yn — Qn = én is the 
error in the branch where a reference is available and y, is the output 
of the other branch. 

By taking expected values as shown in eqs. (42a and b), we obtain 
the estimated gradient algorithm. In practice, a stochastic gradient 
approach is used to avoid the long delay involved in estimating the 
averages. The update equations are obtained by omitting the expec- 
tations in eq. (42) and making small corrections in the direction of the 
instantaneous values instead. Hereafter, we shall discuss only the 
stochastic gradient method. 

Inserting eq. (42a) into eq. (40a) yields the update equation for the 
coefficient vector 


Chi = C,, = B’(yn re an) T (A6,,) Xn. (43) 


The corresponding equation for the update of the phase jitter corrector 
is 


v 


~ a’, 
On+1 = On 7 Gz nl yn ae ap). (44) 
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Division by a2 has been included in eq. (44) to give the corrections a 
smaller weight, if the nominal data value has a larger absolute value. 
Note that because of the special nature of T'(A@,) defined in eq. (18), 
the matrix multiplication in eq. (43) requires only 2(2M + 1) multipli- 
cation. Equation (43) can be reexpressed in terms of the complex 
equalizer C,, and input X, as 


Casi = Cn — B’Xal (Yn — An) exp(JnwoT + J4n)]*. (45) 


The equalizer coefficient and phase adjustment algorithms repre- 
sented by eqs. (44) and (45) are similar to the ones published in Ref. 
3. The main difference is that here, only the error in the digital data 
path appears in the adjustment algorithms. 

An important parameter in the evaluation of the dynamic behavior 
of the control loop is the rate of convergence (Roc). For the case of 
updating the equalizer only, the Roc can be analyzed using, for exam- 
ple, Ungerboeck’s method,® since the stochastic recurrence equations 
for the excess mse can be cast in the form analyzed in Ref. 8. Combining 
eq. (43) with eq. (14), it is seen that for the stochastic gradient case 


T (—A6,)Cn+i = [I — B’X,Xh]T(—-AG,)Cr + B’anX (46) 
From eqs. (21) and (24) the excess mse is given by 
€n — €opt = (Ca — Copt)’T (AOn)AT (—AAn) (Cn — Copt). (47) 
Therefore, 
€241 — Esp. = DZ AD,, (48) 
with 
D, = (I — B’X:Xn)T (—AG,) (Cr — Cope) 
— B’X»[XiT (—AG,)Copt — an]. (49) 


The convergence of the excess mse eq. (48) is analyzed in Ref. 8. There 
it is shown further that for stability the control loop constant has to 
satisfy 


’ So i. 
0S PSFK,’ 
with 
| (51) 
© E (KEK) 


The control loop constant, Bo, is chosen to give the fastest initial 
convergence. 
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In steady-state the excess mse is given by 


2H ge = B'E(X'X)e2zn 
eo Sop 2 — B’E (XX) 


From eq. (52) it can be seen that the excess mse can be reduced to an 
arbitrarily small value by selecting 8’ small enough. 

It is interesting to note that eq. (50) specifies a stability region for 
B’ equal to that for an equalizer using error signals from both branches 
[E (X’X) = E(X*X)]. From eq. (52) it follows that for a particular 2’ 
the ratio of the excess mse to €2, is the same as for an equalizer using 
error signals from both branches. 

We have not been able to analyze the roc for the joint operation of 
equalizer and phase jitter loops. In contrast to the equalizer update 
equation, the transformation matrix T'(A@,,) in the joint case is involved 
in a nonlinear manner. We, therefore, resort to computer simulation of 
the loop behavior. These results are reported in the next section. 


(52) 


V. SIMULATION RESULTS 


Here, we present simulation results for the hybrid modulation 
scheme described earlier. Pseudorandom digital data selected from the 
{+1, +3} alphabet is used to modulate the in-phase channel, while a 
set of pseudorandom numbers with a Gaussian distribution N(O, Ps) 
modulates the quadrature channel, where P, is the analog signal 
power. Additive Gaussian noise at —30 dB below the average signal 
level is introduced in the channel and, whenever desired, phase jitter 
is introduced. The latter process is modeled by a 60-Hz sinusoid of 10 
degrees peak-to-peak. The channels used in the simulation are (z) a 
good channel with a flat amplitude frequency response within most of 
the frequency band of interest and small delay distortion, except near 
the lower band edge, and (iz) a channel just violating the requirements 
for basic conditioning with both moderate amplitude and delay distor- 
tion. These characteristics are shown in Fig. 3. 

Although it is not intended to start up an equalizer with a reference 
signal only in one branch, we report results of such simulations. This 
gives good insight into the dynamic behavior of the adaptive equalizer 
update loop and facilitates the comparison with a conventional pass- 
band equalizer. 

The first run described is for balanced power (P. = P, = 5), where 
the receiver has a 64-tap T/2 complex equalizer and the error signal is 
derived from the in-phase digital channel alone. The basic channel 
described above is used. Figure 4 is a sample simulation run displaying 
the s/n as a function of time where s/n is defined as the ratio of the 
digital signal power to the digital mse. The latter is taken to be a 
weighted sum of past and present instantaneous squared errors. The 


DATA TRANSMISSION 2051 


j=) 


! 
_ 
oO 


DELAY IN 
MILLISECONDS 
Nn > 


10 log [H(f)| IN DECIBELS 


oO 
= 
NO 
w 
Oo 
_ 
No 
Ww 


FREQUENCY IN KILOHERTZ 


(a) (b) 


GOOD CHANNEL BASIC CHANNEL 


Fig. 3—Channel frequency responses. 


two curves are for two different timing phases in the receiver. As 
shown, the equalizer converges in about 2000 iterations for a step size 
of 8 = 0.0005. The step size giving the fastest convergence, according 
to eq. (53), would be By = 0.0015. A lower value is used in order to re- 
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Fig. 4—Convergence of hybrid equalizer. 
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duce the mse in steady state. The receiver’s digital and analog outputs 
after equalization and demodulation to baseband are displayed in Fig. 
-5 as a scatter plot. The vertical and horizontal axes represent analog 
and digital values respectively and, ideally, the data points would be 
on vertical lines passing through the X-axis at +1, +3. Owing to 
channel noise, there is both lateral and vertical displacement from the 
true values. The decision thresholds for the digital data are the vertical 
lines with abscissae 0 and +2. In this example, and all the others to be 
described, the input s/n is about 30 dB so that the equalizer has done 
a reasonable job in removing intersymbol interference (ISI) caused by 
the basic channel. 

We next exhibit the results for the regular QAM in Figs. 6 and 7, with 
digital data on both branches and using exactly the same channel and 
receiver parameters as before. The ideal constellation in the scatter 
plot would be 16 points with coordinates +1, +3 (in the absence of ISI 





Fig. 5—Hybrid receiver output constellation. 
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Fig. 6—Convergence of regular QAM equalizer. 


and channel noise). As seen in Fig. 6, the convergence is faster than for 
the hybrid modulation since here we used a complex error function, 
leading to a tap weight vector C , that fluctuates less than when using 
a one-sided error signal only. 

We mentioned in Section 3.1 that when the powers in the digital 
and analog branches are balanced, the optimum equalizer and mse are 
the same as that of a regular QAM receiver. Indeed, we see by comparing 
Figs. 4 and 6 that the digital mse’s reach the same levels. It was also 
observed that the significant tap weights for both receiver timing 
phases differ by less than 5 percent. 

Now we exhibit the effect on the mse of unbalancing the analog 
signal power from P; = 5 to some other value. The simulated receiver 
has an AGC that scales the received signal to an average power of “ten” 
and, hence, the equalizer tap adjustment step size is kept the same in 
this series of runs. The analog mse is obtained by averaging all the 
past instantaneous mse’s over time. Instead of presenting a series of 
curves, we summarize the results in Table I. The digital signal power 
is fixed at “five,” but the analog power is varied from “one” to “nine.” 
The output mse normalized to the power of the corresponding signal 
is presented in Table I for the good channel. Thus, we see that, as the 
analog power is increased, the digital mse increases, confirming the 
analytical results in Section 3.3 for the infinite length equalizer. It was 
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Fig. 7—Regular QAM receiver output constellation. 


also shown in Ref. 2 that the error probability upper bound is increased 
when a power imbalance exists in favor of the analog signal. As 
expected, we also see that the normalized analog mse is smaller with 
a larger analog signal power. We also note that the digital and analog 


Table |!—Comparison of analog 
and digital mse for different P, 
(good channel) 


Normalized mse 


Analog 
Power P; Digital 
1 0.0007 
3 0.0010 
5 0.0013 
7 0.0016 
9 0.0018 


Analog 
0.0040 


0.0009 
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mse’s are almost the same for balanced power, as theoretically pre- 
dicted in Section 3.2. 

Simulations were also performed to study the hybrid equalizer 
performance in the presence of sinusoidal phase jitter of 60 Hz, where 
the phase tracker modeled by eq. (44) is used to estimate the jitter 
process. Figure 8 shows a sample run where, after allowing the equalizer 
to reach steady-state, 60-Hz jitter with 10 degrees peak-to-peak am- 
plitude is introduced causing a degradation in performance. The plot 
in Fig. 9 shows the same run, except that the phase tracker is turned 
on shortly after the phase jitter is introduced. The lower the jitter 
frequency and amplitude, the more effective we can expect the phase 
tracker to be.* 


Vi. CONCLUSION 


A data transmission system capable of transmitting and receiving 
analog and digital data simultaneously has been studied in detail. We 
found that it is possible to perform adaptive equalization of the channel 
even when only one of the two quadrature channels is carrying digital 
data. Moreover, the minimum mse and tap-weight vector are un- 
changed from that of the regular QAM as long as the analog and digital 
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Fig. 8—Hybrid receiver performance in presence of phase jitter (no carrier recovery). 
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Fig. 9—Hybrid receiver performance in presence of phase jitter (carrier recovery, 
a = 0.5). 


signal powers are equal. However, start-up with simultaneous analog 
and digital data is slower by approximately a factor of two compared 
to the case of a conventional QAM system. An efficient start-up proce- 
dure might be to train the receiver with digital data on both channels 
and then switch one channel to analog data upon convergence of the 
equalizer, since in the case of balanced power, the equalizer taps are 
the same. We also found that the scheme can tolerate moderate 
amounts of phase jitter. 


APPENDIX A 
Analysis of A and V 
We define, 
Xpn = Re[X» exp(—jOn — jonT )] (53a) 
Xan = Im[Xn exp(—JOn — joonT J]. (53b) 
According to eq. (16), it follows that 
Xn = (Xpn|Xan)- (54) 
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Inserting eq. (54) into the definition eq. (19) of A yields 


t t 
ana oe) 9) 
This can be expressed as 
sinatra Raa) 
where 
Ai = E[(Xpn + jXqn)(Xpn — JXqn)'] = E(KnXn) (57) 
Ao = El (Xpn + JXgn)(Xpn + JXan)'] 
= E[X,X»n exp(—2j0 — 2jwonT)]. (58) 


We note that A» is symmetric and A, is Hermitian. For uncorrelated 
data and noise sequences the &, /th entry in the matrix A» is computed 
with eqs. (5), (6), (9), and (58) as follows 


Ao(k, l) = E[Xn-2Xn-1 exp(—276,, _ 2jwonT) | 
= 2 Mm E(D,D,) Hn-n-»An-i-n 
yp 


exp[ J(On-z + On-1 — 20n) — joo(k + DT] 
+ E(N,-rNn-1exp(—276, — 2jwonT )]. (59) 
From eqs. (2) and (4) we get 
E(D,D,) = (Pa — Po) 8. (60) 


With the assumption that the phase jitter is quasi-stationary over the 
equalizer length, i.e. @,-, = 6, for all k € [—M, M], we obtain 


Ao(k, 1) = (Pa — Po) Y HHyserexp[—joolk + 1)T]. (61) 


Note that Ao(R, 2) is zero, if the powers of the sequences {a,} and {6,} 
are equal, i.e., Pa = Po. 
Similarly the k, /th entry in A, is 


Ailk, 1) = E(Xn-2Xi_) 
= x » E(D,Di)Hn-%-»H n-1-p 
X exp[j(On-2 — On-1) + Jwo(l — k)T] 
+ E(Nn-zNr-i, (62) 
with 
E(D,D3) = (Pa + Ps)8yu; (63) 
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E(N,N;) = Ryn[(u — »)T, (64) 
and the assumption of quasi-stationary phase jitter, this finally yields 
Ailk, 1) = (Pa + Ps) ¥ HA eri 

X exp[ Jwo(l — k)T] + Run{(kR — DT]. (65) 
With the definition of V in eqs. (20) and (54), we find 


v= # (30) (65 
or 
with 

Vi = E[VpnQn] (68a) 
and 

Vo = E[Xgnan]. (68b) 


The &th entry in V; is calculated by inserting eqs. (5), (9), and (53a) 
into (68b) 


Vi(k) = E{a,Re[X,-,exp(—j6, — jwonT)]} 
= E{a,Refexp(j6n-» — JOn — JwokT ) x Di Hn-r-i]}. (69) 
Again, under the assumption of quasi-stationary phase jitter and with 
eqs. (2) and (4), we have 
Vi(k) = PaRe[H_,exp(—jwokT) J. (70) 
Following the same lines, one obtains for V2(&), 
V2(k) = Palm[H-,exp(—jwokT)]. (71) 


A transformed version of A is needed for the analysis of the mse in the 
second branch. According to eq. (18), the transformation matrix is 


(72) 
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Inserting this into eq. (56) yields 


7 w\ 1] Re(A; — Ag) | —Im(A, + Ag) 
1(- 5] ar(3) 7 ee — Az) | Re(A: + As) | — 


By comparing eq. (73) with eq. (56), it is found that 


Ci Ci ; 
n(-2)an(2)-a+a au 
with 
, _ | ~ReAe | —ImA2 
ae Ee ReAz | o 


In eq. 75, A’ is symmetric but not positive definite. Then, referring to 
eq. (61), we see that for the balanced power case, i.e., P2 = P, 


Px ween a — 
A’=0 and at | ar(5) A. 


APPENDIX B 
Optimum Tap Weight and mse of the Infinitely Long Equalizer 
Define 


Cope = (Ci | C2) (76) 


and use eqs. (56) and (67). Then, the equation for the optimum 
coefficient vector eq. (22) can be expressed as follows 


Re(A; + A2)C; — Im(A; — Az) Co = 2Vi 
Im(A; + A2g)Ci + Re(Ai — Az) Co = 2Vo, (77) 


where it is assumed that the demodulation phase error A@,, equals zero. 
(It has been stated that the minimum mmse is independent of A@,, at 
least for a constant phase offset. Therefore, no loss of generality results 
because of this simplification). 


Let 
C= (Ci + j€2) (78) 
and | 
V=(Vi+ JV). (79) 
This allows us to write eq. (77) in complex notation 
Ai€C + Ao C* = 2V. (80) 


Equation (80) can be expressed in the components of the vector V and 
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matrices A; and Az 


M 
[Ai(k, 2)C(2) + Aa(k, )C*(D)] = 2V(R). (81) 
M 


l=— 


Note from eq. (61) that since Ae is not a Toeplitz matrix, the sum in 
eq. (81) is not a convolution even if the dimension M approaches 
infinity. Define 


C(L) = C(Dexp(jwolT), (82) 
Vil) = ViDexp(jwolT), (83) 
Ai(k — 1) = Aik, Dexp[—jwo(l — k)T], 
= (P. + Po) x H, Hy +-1 


+ Ryn(k — Dexp[—jwo(l — k)T], (84) 
A2(k — 1) = Ao(k, Dexp[ jwo(k — DT], 
= (Pa — Ps) ¥ AH, Ayse-i (85) 


Equations (61) and (65) were used to get eqs. (84) and (85). With these 
definitions, we can transform the problem to baseband. Inserting eqs. 
(82) to (85) into eq. (81) yields 

M 


y [Auk — DED + Ad(k — DE*(D] = 202). (86) 


=—M 


Here the sums will be convolutions if M — o. Consequently, the 
equation can be expressed in the domain of Fourier transform as 


Ai(w) E(w) + Ao(w)E*(—w) = 2V(w). (87a) 


Together with the transform of the conjugate complex of eq. (86), 
which is 
Ai (—w)C*(—w) + AZ (—w)C(w) = 20*(-w), (87b) 
it can be used to find the formal solution 
= 2V(w) A*(—w) — 2V*(—w)A 
Coe o)Arf «) a ( ithe a (88) 
Ai(w) At (—w) — Ae(w) Az (—w) 


The Fourier transform of the sampled impulse response H(7 + kT), 
where 7 indicates the sampling instant, is defined as follows 


Heq(w) = y H(t + kT )exp(—jwkT), (89a) 


K=-a 
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where 
T a/T 
A(t + kT) = =| Heq(w)exp(jokT) dw. (89b) 
27 aoe 


Assuming the impulse response H(t) has the Fourier transform H(w), 
Poisson’s sum formula can be used to get the following relation 


between the spectra of the sampled and the continuous functions 
a exp(jwr) © k ; T 
H.q(w) = T > ao + 20 *) exo( J2ak *)) (90) 


k=—00 


Using the spectral density of the filtered noise in baseband 
| Neq(w) |? = Y Run(kT)exp[—j(w + wo)(kT)] (91) 
k= 


and eqs. (89a) and (89b), the transforms of A, Ao, and V can be shown 
to be 


Ai(w) = (Pa + Po) | Heq(—w) ? a | Neq(—w) I’, (92a) 
Ao(w) = (Pa — Ps)Heq(w) Heq(-), (92b) 
Vw) = PaHeq(—«). (92c) 


From eq. (24) and with eqs. (67), (76), and (78) the minimum mse can 
be expressed as 


€ ent = Py - ConrV =P,- R(C*V). (93) 


In eq. 93, C* V can be viewed as the zeroth term of the convolution of 

the sequences (V,) represented by V and (C*,) represented by C*. 

Multiplying the spectra and transforming back to time domain yields 
PEO A 

Re(C*V) = on 3 [C*(w) V(w) + C(—w) V*(—w) dw. (94) 


—a2/T 


Taking into account the modulation in eqs. (82) and (83), this also can 
be expressed in terms of C(w) and V(w), which are defined above. 


a/T 
Re(C*V) => | 5 (Cw) He) + C(—w)V(—w)]dw. (95) 
—n/T 


Combining eqs. (89a), (89b), (92a), (92b), (93), and (95) finally yields 
the desired expression of the minimum mse 


jo. 2 QPS | is Pi[Z(w) + Z(—w)] + 1 


co 2 | 4PaPsZ() X Zw) 
+ (Pa + Pr) X [Z(w) + Z(w)] + 1 


dw, (96) 
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where 











Heq(w) 
Z(w) = |—— 97 
(w) Nea(ca) (97) 
In the balanced power case P, = P:, we can simplify further to yield 
TP. (”" 1 1 
Bf a eg. 198 
cop On [ ip A2PaZ(o) +1] 22PaZ(—w) +1] 28) 
Since both terms are integrated over one full period this yields 
yo EP. de 
€opt = —— ——>—_—. (99) 
Qn J, 2PaZ(w) + 1 


It can be shown that the derivative of the minimum mse eq. (96) with 
respect to the power in the second branch is 

de2 TP? (" [Z(w) — Z(—w) Pde 
aP, 2a D? 








(100) 


—a/T 


where D is the denominator of the integrand in eq. (96). This means 
the derivative is positive unless Z(w) is symmetric in which case the 
former is zero. In general, a decrease in the power of the second branch 
‘will decrease the minimum mse in the first branch. 
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High-Speed Measurement and Control of Fiber- 
Coating Concentricity 


By D. H. SMITHGALL and R. E. FRAZEE 
(Manuscript received May 19, 1981) 


A technique has been implemented to measure and control the 
eccentricity of lightguide fiber in transparent polymer coating mate- 
rials. It is based upon a model which describes the characteristics of 
a forward-scattered light pattern generated by transversely illumi- 
nating coated fiber with a laser beam. The model predicts the behav- 
ior of the principal characteristics of the pattern as a function of fiber 
eccentricity within the coating. The implementation automatically 
detects and controls the position of the dominant pattern feature to 
maintain an average fiber-coating concentricity within 2 um over 
multikilometer lengths of fiber. 


l. INTRODUCTION 


The concentricity of an optical fiber in a plastic coating affects the 
fiber strength, transmission loss, and the connectorization process. 
Self-centering coating techniques, such as the flexible tip applicator,’ 
or hydrodynamically designed applicators,” have not been completely 
effective in maintaining the fiber centered in the coating. An alterna- 
tive to such passive fiber centering is to measure and control the 
location of the fiber in the coating. 

Kichenbaum has described a technique in which the coated fiber is 
illuminated by a laser beam (Fig. 1) and the symmetry of certain 
features of the forward scattering pattern are used to determine coating 
concentricity.? His model analyzed coatings whose refractive index is 
greater than that of the fiber, and eccentricities are normal to the 
direction of the laser beam. The effects of refractive index and relative 
fiber and coating diameters for the concentric case are also described. 
The model is limited, however, to fiber offset normal to the direction 
of incidence and does not consider the effect of the fiber core upon the 
forward-scattered pattern. 


2065 





Fig. 1—Forward-scattering pattern generated from laser-illuminated fiber. 


An alternate approach, first examined by Presby and subsequently 
described by Marcuse for silicone coatings, examines the backward- 
scattered pattern from a coated fiber illuminated by a laser beam.*° 
The structure of the pattern is more complex than a forward-scattered 
pattern because the scattered light passes through the coated-fiber 
structure twice. 

The technique of measuring fiber properties by forward-light scat- 
tering has been shown to be very powerful, and a similar approach 
could prove useful to automatically measure the fiber concentricity 
during the coating operation, and control the position of the fiber 
within the coating material.*° Of the two scattering techniques. de- 
scribed, the forward-scattering technique has the advantages that the 
pattern has a simpler structure, and as a practical matter, the forward- 
scattered pattern contains more light energy than the backward-scat- 
tered pattern, thus, requiring a smaller laser to provide sufficient 
information to a detector. In developing a centering control based 
upon forward-scattered light, the effects of both the fiber core and an 
arbitrary fiber eccentricity upon the structure of the scattering pattern 
must be considered. ; 


ll. CONCENTRIC FIBER RAY-SCATTERING MODEL 


The smoothed envelope of the intensity of the forward-scattered 
pattern from a graded-index, multimode fiber coated with an epoxy 
acrylate material is shown in Fig. 2. The pattern is typical for fiber 
with a 50-um core with 0.23 NA (numerical aperture), 125-~m clad 
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Fig. 2—Smoothed intensity of forward-scattered pattern for graded-index fiber in 
high-index coating. 


diameter, and 250-um coating diameter. The refractive indices of the 

cladding and coating: are 1.457 and 1.539, respectively, at 0.6328 um. 
The structure of this pattern can be analyzed by considering the 

three-level concentric model shown in Fig. 3, where the radii are 





Fig. 3—Three-level concentric model. 


FIBER COATING 2067 


normalized to the coating radius, and the refractive indices are assumed 
constant. The scattering angle @ for a ray striking the coating a distance 
x from the optical axis can be determined from the formula of Bouger,”® 
when the ray passes through the core, as x S n2 c, and 


9 = 2 (ao — a1 + 2 — a3 + 4 — Os), (1) 
where 
x = sin ao 
=n; Sin a) = nN) 2 SiN ae 
=n2 asin a3 = nC Sin ay 
= 3 C SiN as. (2) 


The exit angle, 6, for rays refracted through the cladding, nz c < 
x =n, a, is given by 


8 = 2 (ao — a1 + a2 — Qs), (3) 


where the angles are defined in eq. (2). 

In the case of a high-index coating, eq. (3) holds over the range 
Ng c= x< xn. a. In the range nz a S x < nj, G, rays are reflected from 
the surface of the fiber, and the exit angle is given by 


6 = 2 (ao — ai + G2 — 17/2). (4) 
The exit angle for rays refracted through the coating only is 
6 = 2 (ao — am). (5) 


For the parameters given above, the relationship between the scat- 
tering angle and the normalized incident ray position is shown in Fig. 
4 for x between 0 and 1. For x in the range 0 to —1, 6(—x) = —0@(x). 
Ray positions less than x, pass through the core of the fiber; rays 
between x, and x2 pass through the fiber clad and coating. Ray 
positions greater than x2 pass through the coating only. 

Over the angular range 9, to @,, there is an interference pattern 
generated by rays passing through both the core and cladding of the 
fiber. This interference is observed as an intensity maxima near +10° 
in Fig. 2. The structure of the interference pattern depends upon the 
core index gradient,’ and for typical near-parabolic index gradients, 
there exists a single intensity maximum whose location is nearer 4, 
than 9,. The angle 6. is independent of core structure, but depends 
upon core diameter. For single-mode fibers, @, — 0, and there will be 
no measurable intensity peak in the scattered-light pattern correspond- 
ing to the core-clad interface. 

The intensity maxima near +20° in Fig. 2 result from the focusing 
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_ Fig. 4—Ray-scattering pattern for step-index, multimode fiber with high-index coat- 
ing. 


of rays passing near the clad-coating interface. The location of these 
caustics in Fig. 4 is the angle 6,,, where d@/dx = 0. If the slope of the 
inverse scattering curve dx/d@ is interpreted as the relative energy 
density of the scattered light, then dx/d0 — © is a region of focused 
energy which will result in an intensity maximum. Conversely, the 
energy density of the forward-scattered pattern for angles greater than 
Om is very low. This is observed in Fig. 2 as an abrupt reduction in 
detected energy near the peak intensity at 6,,. 


iil. ECCENTRIC FIBER RAY-SCATTERING MODEL 


The influence of fiber eccentricity within the coating upon the 
forward-scattered pattern may be determined by examining its effect 
upon the position of the intensity maximum at 6,,. The maxima at @, 
may or may not exist, depending upon the structure of the fiber. The 
results in the preceding section show that for typical single or multi- 
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mode fibers, the effect of the core and the effect of the fiber/coating 
relationship upon the scattering pattern are distinct and separate. 
Therefore, in the subsequent development, a homogeneous fiber struc- 
ture will be assumed with no loss of generality. 

Consider the general nonconcentric structure of Fig. 5 in which the 
center of the fiber is displaced from the center of the coating by a 
distance d at an angle ¢ from incidence. The radii of the fiber and 
coating have been normalized to “a” and 1, respectively. The refractive 
indices are nz and m, respectively. Using Snell’s Law and the geometric 
construction of Ref. 5, expressions for the angles describing the ray 
path can be determined by modifying the equations describing back- 
scattered rays to describe the refraction of forward-scattered rays. 


ao = sin?! x. (6) 
1 
ai = sin”? (2 sin ca) (7) 
ni 
1, d ., 
Y= 7 sin a — — sin (ao — a + $). (8) 
a2 = sin y. (9) 
n 
y2= — sin a2. (10) 
nN2 
a3 = sin7! Y2- (11) 
a= 03. (12) 
a5 = Qe. (13) 





Fig. 5—Refracted ray in two-level structure with offset core. 
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Fig. 6—Internal reflected ray in two-level structure with offset core. 


ae = sin” {a sin a2 + d sin[ao — a1 + 2(az — a3) + ]}. (14) 
a7 = sin”! (n sin ae). (15) 


These equations represent an arbitrary ray passing through the 
coated fiber. Certain conditions, however, yield internally reflected 
rays or rays passing only through the coating material. Hence, special 
cases of eqs. (6) to (15) need to be considered in determining the exit 
angle (x). 

For a ray which passes through the fiber and coating, 


Y1, ye< 1, and (16) 
@ = (ao — a1) + 2 (a2 — az) + (a7 — ae). (17) 
For a ray which passes through the coating only, 
yi>i1, and (18) 
0 = 2 (ao — a). (19) 


For a ray which passes through the coating and is reflected from the 
fiber surface (Fig. 6), 


yi<il. 

y2>1. (20) 
Y3 = do — 01+ 6 + 2a2 — 7/2. (21) 
ag = sin”! (a sin a2 + d sin ys). (22) 
do = sin (n; sin ag). (23) 
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6 = ao + 2a + ag — A — 3 — 7. (24) 


There is one additional case occurring when nj sin ag > 1 or nm; sin 
ag > 1, which represents total internal reflection at the point where 
the ray should exit the coating. 

Equations (6) to (24) can be used to generate ray-scattering angles 
4(x). By appropriate choices of d and 4, scattering angles for arbitrary 
fiber eccentricity may be examined. 


IV. RESULTS FOR TWO-LEVEL MODEL 


For the purpose of investigating the effect of fiber eccentricity, 
consider the model wherein the ratio of core and outer diameter 
a = 0.5, and the refractive indices of the core and outer layers are 
1.457 and 1.539, respectively. These index values correspond to those 
of fused silica and an epoxy acrylate coating material. The effects of 
changing the material indices or the fiber/coating diameter ratio were 
examined in Ref. 3 for a concentric structure. 

The effect of offset “d” along the direction of incidence (¢ = 0), and 
normal to the direction of incidence (¢ = 7/2*) upon 6, is shown in 
Fig. 7. For each case, there is a 6,, corresponding to refraction of the 
light rays to both sides of the axis of the incident light. The angle 07, 
results from incident rays striking the fiber at normalized positions in 
the range 0 to +1, and the angle @;, results from incident rays striking 
the fiber for x between 0 and —1. There is a symmetry such that 67, 
and @;, exchange roles for ¢ = @ + 7. This is illustrated in Fig. 7 by 
showing the dependence of @,, upon negative d. 

For the case ¢ = 0, the loci of 97, and 0;, are symmetric about the 
vertical axis with increasing separation as the fiber is displaced away 
from the laser source. For d varying from —0.2 (d = 0.2 @¢= 7) tod 
= 0.2, 6, varies from 18 to 26 degrees. 

For the case ¢ = 7/2, the maxima 6;, and 6;, are symmetric about 
the axis only for d = 0. For increasing d at ¢ = 7/2, 07, increases and 
é;, decreases. Furthermore, the separation increases as d increases, 

-from 41 degrees at d = 0 to 54 degrees at d = 0.2. For ¢ = 37/2, the 
roles of 9;, and 6;, are reversed. 

At a fiber displacement of approximately d = 0.2, the ray generating 
67, is totally internally reflected at the point where it would exit the 
coating material, and the scattering-pattern feature corresponding to 
67, does not exist. 

For the more general case of an arbitrary ¢, define the separation of 


* In the subsequent discussion, the angle of the fiber offset is given in radians and the 
ray-scattering angle is given in degrees. 
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Fig. 7—Locus of intensity maxima 9, as a function of spatial offset for two angles of 
offset. : 


the scattering-pattern intensity maxima as . 
A = On — On. (25) 


It has been illustrated in the previous discussion that A as a function 
of both d and ¢ will provide an indication of the composite behavior of 
the scattering pattern. In addition, A is a factor in the measured 
eccentricity. 
On + On 
E= er ae. (26) 
Figure 8 shows a plot of A as a function of the offset orientation, ¢, 
with the offset magnitude, d, as a parameter. The pattern is cyclic in 
@ as expected. For d > 0.15, A— © near the orientation ¢ = +7/4. The 
intensity maxima 97, does not exist for d > 0.15 at this angular 
orientation. The fiber is sufficiently close to the surface of the coating 
that the cumulative refraction of the light rays remains monotonic 
with ray height, x, as in the case of the homogeneous medium. 
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Fig. 8—Separation (A) of intensity maxima as a function of spatial and angular offset 
with respect to incident laser beam. 


Therefore, there is no region in which scattered energy is concentrated 
for eccentricities greater than 15 percent of the coating radius. This 
condition limits the range over which a high-intensity feature could be 
used to detect fiber eccentricity. Subsequently, it will be shown that 
this is a minor restriction. 


V. OPTICAL DESIGN FOR COATING CONCENTRICITY MONITOR 


The model developed in Section III has shown that if the intensity 
maxima at 67, and 6m, or, correspondingly, the abrupt change in 
intensity at the edges of the pattern, could be detected, their position 
would provide a monotonic measure of the eccentricity of the fiber 
within the coating. By illuminating the coated fiber in two orthogonal 
directions, the degree of fiber eccentricity is uniquely established. The 
fiber may be centered within the coating by establishing symmetry 
within each of the two orthogonal patterns. 

The optical layout for the automatic detection system is shown in 
Fig. 9. The beam for a 1-mW HeNe laser is attenuated by a neutral 
density filter and split into two equal-intensity beams which are each 
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Fig: 9—Optics layout for automatic centering unit. 


reflected to intersect at the fiber. Located approximately four centi- 
meters behind the fiber is a viewing screen of white bond paper. The 
paper provides a high contrast background, and partially diffuses the 
scattering pattern, eliminating the fine structure corresponding to 
interference of refracted and reflected rays. The scattering pattern, as 
viewed on the screen, appears as a bright bar on either side of the very 
bright central spot (Fig. 10). At the end of each bar is a spot of slightly 
greater intensity corresponding to the outer peaks of Fig. 2. For 
multimode fiber, a second bright spot may be discerned between the 
central peak and the edge of the bright region. 

The scattering pattern is viewed by a closed circuit television (CcTV) 
camera through a 0.633-um interference filter. The filter permits the 
device to operate in normal room lighting conditions, while only the 
scattering pattern is observed. The camera is mounted such that the 
scattering pattern is crossed by a multiplicity of scan lines. Thus, along 
each scan line the intensity of the pattern is sampled once. By extract- 
ing this information from the ccTv output, the scattering pattern can 


(a) (b) 


@ INTENSITY REGIONS 


Fig. 10—Forward-scattering patterns as seen on viewing screen. (a) Fiber centered in 
coating. (b) Fiber eccentric in coating. 
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be reconstructed as in Fig. 2. Furthermore, the location of important 
features within the pattern can be determined by counting the number 
of samples, or scan lines, between features. With the full-field view of 
cctv, the detector is insensitive to tilt in the scattering pattern result- 
ing from an angularly-misaligned camera or fiber. 

To determine the centered position of the fiber within the coating, 
the forward-light-scattered pattern is made symmetric with respect to 
a central intensity maximum of the pattern, generated by the incident 
laser beam. If the coated fiber is not centered in the beam, a slight 
offset in the pattern with respect to the central peak exists, and when 
corrected by moving the coating applicator, results in an off-center 
coating. In addition, centering the coated fiber in the laser beam 
maximizes the energy transferred from the beam into the scattering 
pattern, and ensures that the two wings of the scattering pattern will 
have equal energy, maximizing the sensitivity of the measurement 
system. 

To align the laser beams to the coated fiber, a rotatable cube, 
mounted on the shaft of a small servo motor is located in each optical 
path. An aperture placed between the cube and the viewing screen 
eliminates spurious scattering effects from the corners of the cube. 

The optical components, screens, and CCTVs are mounted on a 
platform with the coating cup. The platform is adjustable with respect 
to a fixed base plate which, in turn, is attached to the fiber draw tower. 
Thus, the coated fiber structure is fixed with respect to the illuminating 
laser beams, and the fiber moves within the coating, limited by the 
orifice in the tip of the applicator. 


VI. VIDEO SIGNAL PROCESSING 


Two CCTV cameras are mounted such that each views one of the 
orthogonally mounted viewing screens and cuts the projected forward- 
scattering pattern with a plurality of horizontal scan lines. The camera 
outputs are treated independently of each other but via identical 
circuitry (Fig. 11). For each, the composite video signal is separated 
into the video and synchronization components. 

The video portion of the signal is input to two separate integrator 
circuits. One integrator sums the collective value of all the video pulses 
contained in one complete vertical field. Therefore, the output voltage 
is proportional to the total energy contained in the forward-scattered 
pattern. This signal, input to the microprocessor through an analog- 
to-digital (A/D) converter, is used to center the laser beam on the : 
coated fiber. The update rate of the signal strength measurement is 
equal to the vertical field scan rate (60 Hz). 

The second integrator sums the signal levels contained in each 
horizontal scan line. The output, controlled by line synchronization 
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Fig. 11—Video signal processing circuit. 
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pulses to form a serialized boxcar representation of the envelope, is 
input to a high-speed A/p converter which converts each scan line 
level to eight bits of binary data, which are input to the microprocessor. 
The video scan line signal is conditioned at the horizontal scan rate of 
approximately 63 ys. 


Vil. MEASURE OF FIBER ECCENTRICITY 


Once the laser beam has been centered on the coated fiber structure 
the envelope of the scattering pattern is examined to determine fiber 
eccentricity. Establishing a reference on the frame synchronization 
pulse, and subsequently, on each of the line synchronization pulses, 
230 consecutive samples are read from the A/D converter. Each sample 
represents the intensity of the scattering pattern detected during line 
scan. 

The data is scanned to locate the intensity changes which correspond 
to the edges of the laser beam and the forward-scattered pattern. From 
the scattering model it is known that an intensity maximum must 
occur near the locations at which the intensity goes to zero. 

Let {X;, 1 = 1, ---, nm} be the intensities of the samples {i}. A 
maxima is defined as: 


M;: Xi — Xi+1 = 0 
X; — Xi-1 > 0. (27) 


The two maxima corresponding the 67, and 6;, may be determined 
by searching the data near the locations of the edges of the pattern. 
From the relative locations of the central intensity maximum corre- 
sponding to the laser beam and the two maxima, the angles 67, and 
@;, can be determined and the eccentricity measure 


e=6n+ On (28) 


defined. Equation (28) represents a monotonic measure of fiber eccen- 
tricity, since it can take on both positive and negative values. Further- 
more, the function can be used directly to generate a corrective control 
signal to center the fiber in the coating. 


Vill. CONTROL OF FIBER POSITION 


The coating material, with a viscosity of ~50 poise at room temper- 
ature, presents a strong dampening effect on the motion of the fiber. 
Consequently, when the cut is moved with respect to the fiber, a 
settling time of several seconds is observed. Conversely, once the fiber 
is centered within the coating cup it will tend to remain centered such 
that readjustment is only occasionally required. It has been observed 
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that fifteen minutes may be required to reach this equilibrium state, 
with frequent position adjustments necessary prior to reaching equilib- 
rium, 

The algorithm implemented for fiber positioning is a deadband 
control, where for e given by eq. (28), the control u is: 


u=0 lsexl 
u = Uo e>1 
u = —Uo e<-l. (29) 


The value of uo is selected large enough to overcome friction and 
hysteresis in the servo motors and platform transport mechanism, yet, 
small enough to account for the slow dynamic response of the fiber 
moving through the coating material. The deadband about the point 
of pattern symmetry prevents the servo motor from constantly re- 
sponding to small disturbances resulting from short-term fluctuations 
in eccentricity, or single quantum steps in the computation of the 
eccentricity function [eq. (25)]. The normal system condition for a 
process in equilibrium is one of quiescence. 

For the typical case of an epoxy acrylate-coated silica fiber, with 
125-um-fiber diameter and 250-um-outer diameter, a scattering pattern 
with the intensity maxima at +20 degree is generated. A pattern nearly 
4 cm wide projected onto the viewing screen is intersected by approx- 
imately 60 horizontal scan lines on the cctv. With the algorithm 
previously described, centering is controlled within +1 part in 30, or 
+2/3 degree. From Fig. 7, the sensitivity to fiber offset normal to the 
incident laser beam is 1 percent of the coating radius per degree shift 
in concentricity. Therefore, the control of the average concentricity is 
+1 ym. 

Localized fluctuations may exceed this value. Experience has shown 
that long lengths of fiber may be coated with coating concentricities 
within 1.5 percent of the coating diameter. 


IX. CONCLUSION 


A multilevel model for a lightguide fiber coated with material with 
refractive index greater than that of the fiber, has been used to 
determine characteristics of the forward-scattered pattern with respect 
to the coating concentricity. The results of the model have been used 
as a basis for a system which detects and controls the average position 
of the fiber in the coating within 2 pm. Experimentally, the fiber- 
coating concentricity has been maintained within 1.5 percent over 
multikilometer lengths of fiber. With this control, the predicted limi- 
tation in detectable eccentricity of 7.5 percent of coating diameter is 
not a significant factor. 
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A system whereby speech is used as a data carrier is proposed. The 
speech, sampled at 8 kHz, is divided into blocks of N samples, and 
provided the correlation coefficient and mean square value of the 
samples exceed system thresholds, data is allowed to be transmitted. 
If the data is a logical 0, the samples are sent without modification; 
however, if a logical 1 is present, frequency inversion scrambling of 
the samples occurs. The receiver performs the inverse process to 
recover both the speech and data. Data rates of 700 b/s were achieved 
without data errors or speech distortion via an ideal channel. The 
effects of additive background and channel noise were investigated, 
and the system was shown to operate at 126 b/s with no data errors 
when the additive noise was as high as 10 dB below the mean square 
value of the speech signal. 


|. INTRODUCTION 


There are numerous schemes’” for analog scrambling of speech 
signals, but they all require a scrambling key. For example, we may 
sample the speech at a rate in excess of its Nyquist rate, parcel the 
samples into blocks, and rearrange the blocks prior to transmission. 
This rearrangement of the blocks breaks up the rhythm in the speech 
making it difficult for an eavesdropper to comprehend the conversa- 
tion. The shuffling of the block positions is done under the auspices of 
the scrambling-key, and provided the receiver knows this key and, 
hence, the descrambling key, the blocks of speech can be correctly re- 
positioned and made intelligible to the desired recipient. 

It is not our purpose to describe the numerous scrambling tech- 
niques, but rather to suggest a method whereby speech and data can 
be transmitted simultaneously over the channel by using scrambling 
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strategies. The principle is very simple. The scrambling key becomes 
the data to be transmitted. The receiver adopts the role of code- 
breaker. Every time the receiver correctly guesses the key and breaks 
the code, it recovers both the speech and the data. For the scheme to 
have any significance, the receiver must break the code successfully at 
nearly every attempt. Therefore, we must select scrambling keys which 
are easy to break, and this means that we are not aiming for speech 
privacy (although a degree of privacy may be achieved as a by- 
product). The scrambling process is, therefore, a catalyst which enables 
the data to be transmitted. 

At first sight, it might appear that we are getting something for 
nothing. With care we can arrange for the data to be transmitted at 
negligible error rate, the speech faithfully recovered, and a small 
bandwidth expansion of the transmitted signal compared to the origi- 
nal speech. These rewards are derived from the inherent redundancy 
in the speech signal. Indeed, we emphasize that the method will work 
with any signal that has correlative features, such as speech, television, 
facsimile, and analog-plant control signals, like pressure and temper- 
ature variations, etc. 


Il. SIMULTANEOUS SPEECH AND DATA TRANSMISSION USING 

FREQUENCY INVERSION SCRAMBLING 

As a demonstration of the concept, we describe the transmission of 
data using the simplest of scrambling methods, frequency inversion. 
In this method, speech, band-limited to 3.4 kHz, is sampled at 8 kHz 
and N samples are processed at a time. Let us represent these samples 
as 

Si = Xo, X1, X2, +++, XN-1- (1) 


To invert the frequency components associated with these N samples, 
all we need to do is to alter the polarity of every other sample,** 
namely, 


So = Xo, —X1, X2, —X3, + °* » ~XN-1 (2) 


N even. 

In frequency-inversion scrambling (FIS), sequence S2 would always 
be transmitted, but in our scheme, sequence S2 is used when we decide 
to transmit data and, further, the data is a logical 1. Observe only one 
bit per N speech samples is transmitted. 

To minimize the number of bits received in error, we proceed as 
follows. The calculation 

N-2 


YD XiXis1 
i=0 


p = Wri 
y xi 
a 
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is made on the original speech sequence S; and called here the 
correlation coefficient, and the mean square value 


1 N-1 


2 2 
Ox N > Xi (4) 


in the block of speech samples is also found. Notice that the correlation 
coefficient p; of the scrambled sequence S2 is —p. Figure 1 shows the 
block diagram of the system. Mean square value o2 and correlation 
coefficient p are compared with system threshold parameters T and K 
in comparators COMP 1 and COMP 2, respectively. Parameters T' and 
K may be selected such that o2 > T and p > K generally implies the 
absence of unvoiced speech and silence, assuming there is no additive 
background noise. This strategy aids in reducing the number of re- 
ceived bit errors when transmitting through noisy channels. Later we 
will give details of how T and K are selected. 
Data is only transmitted when the Boolean equation 


y= CiC2 (5) 
is a logical 1, where 
logical 1 if 02 = T 









Ci = 3 : 
logical 0 if 02 < T 
and 
CO = logical lifp=K, 
2— . 5 
logical 0 ifp << K 
INVERT So COMBINED 
ALTERNATE O SPEECH/DATA 
i SAMPLES a SIGNAL 
O 
O 
INPUT 
SPEECH 
SAMPLES 
O 


DATA FOR 
TRANSMISSION 
O 


Fig. 1—Block diagram of the sspT/FIs system at the transmitting end for the 
simultaneous transmission of speech and data. 
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The data sequence is allowed to select S, or S: if eq. (5) is satisfied. 
Thus, if y = 1, the switch in Fig. 1 is set in position A or B if the data 
is logical 0 or 1, respectively, i.e., a sequence S7 is generated according 
to 


_ | Si, data = logical 0. 
Sh de a data’ = losieal 1 (6) 


Whenever y = 0, Sr = Si, the unscrambled speech. The sequence Sr is 
appropriately filtered and transmitted as the combined speech and 
data signal. 

To illustrate the effect of the imposition of data on the speech signal, 
we show the waveforms in Fig. 2. In (a) and (b) of Figure 2 an arbitrary 
segment of speech and the corresponding transmitted signal containing 
data for 120 blocks are shown, respectively. The envelope of the signal 
is barely changed, and blocks conveying zeros are not scrambled. 
Hence, the transmitted signal is perceived as a distorted version of the 
input speech—intelligible but tiresome to a listener. A smaller segment 
of the original speech signal, and the resulting transmitted signal for 
the logical values of the data shown, are displayed, respectively, in (c) 
and (d) of Fig. 2. There are now only 24 blocks, and the frequency 
inversions are apparent when the data is a logical 1. 





(a) (b) 


(c) (d) 


Fig. 2—Arbitrary segments of speech are shown in (a) and (c), and the corresponding 
transmitted signals are displayed in (b) and (d), respectively, N = 8, T = 0, K = 0.6. The 
logical values of the data signal are shown for the transmitted signal (d). 
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The signal emerging from the transmission channel is sampled at 8 
kHz to give S;, where a caret (~) above the symbol signifies its 
presence at the receiver. In the absence of channel impairments 
Sr = Sr, the power 62 and correlation coefficient 6 of the sequence S7 
in the block of N samples is computed according to eqs. (3) and (4). 
The operations associated with eq. (5) are implemented, and the 
following processes are performed until a decision is reached. 

(t) If ¥ is a logical 1, data is assumed to be transmitted of value 
logical 0, and Sr = S, is the recovered speech sequence. 

(it) If y is a logical 0, data may or may not be present. To determine 
whether data is present, every other sample in Sz is inverted and the 
scrambled correlation coefficient p; is computed. Then, 

(a) if ¥ remains a logical 0, it is decided that no data was sent. 
The recovered speech sequence is, therefore, the original received 
sequence Sr. 

(b) if y becomes a logical 1, it is decided that data is present of 

value logical 1, and the recovered speech sequence is the scrambled Sr 
sequence. 
Observe that if the conditions are not correct for the conveyance of 
data, or if a logical 0 is transmitted, the speech is dispatched without 
being scrambled. Only when a logical 1 is transmitted is scrambling 
performed, and this is done twice, once at the transmitter and once at 
the receiver. Should a data error occur, the speech at the output of the 
receiver may be erroneously scrambled. The resulting error samples in 
the block of length N have a rate of 4 kHz, and magnitudes double 
that of the original speech samples. 


lll, PERFORMANCE PARAMETERS FOR DATA TRANSMISSION 


From a data transmission point of view we are interested in the 
transmitted bit rate (TBR) and the total bit error rate (TBER). Data 
will only be transmitted when y of eq. (5) is a logical 1, and the 
efficiency 7 of the system to transmit data is given by 


actual data rate 


a possible data rate (7) 
from which 


_ 1fs 
TBR =, (8) 





where f, is the sampling rate of the speech signal. Error bits are those 
bits generated incorrectly at the receiver, and the number of bit errors 
per second is the TBER. Let the measure of the deficiency of the system 
that results in erroneous data at the output of the receiver be known 
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as the data transmission deficiency, 
___ data error rate 
possible data rate 


(9) 
Then 


TBER = 


\fs 
W (10) 


or 


TBER = BER + FBR, (11) 


where BER is the conventional bit-error rate that relates to those bits 
transmitted that were erroneously received. The term FBR is the false- 
bit rate that is associated with the generation of bits at the receiver 
when none were actually transmitted, and the declaration at the 
receiver that no bits were transmitted when they really were. Repre- 
senting the states when the transmitter does not transmit data, and 
when the receiver deems that no data was transmitted, by the symbol 
—1, and using the logical data symbols of 1 and 0, we are able to 
construct Table I, which shows all the possible data-error conditions. 

Let us consider the case of no additive noise to the speech input 
signal, and an ideal channel. In this case, the false bits are always a 
logical 1 and occur when no data (—1) was transmitted. This is state A 
in Table I. These errors occur when the power in the block is above 
the threshold, o2 = T, and the correlation p is below its threshold, 
p < K, prohibiting transmission of data. Now K is a positive number, 
and the bit error will occur if p is negative having a magnitude Ki, say, 
that is greater than K. At the receiver, 8; = S,, 9 = logical 0 and, 
hence, the received sequence is scrambled. Because the correlation 


Table |I—Data error table and output 
speech status 


Recovered 
Data Status Speech 
———— ‘Status 
State TX RX at RX 
A -1 1 I 
B -1 0 C 
C 1 0 I 
D 1 ~1 I 
E 0 1 I 
F 0 -1 Cc 


Note: Logical states of the data are rep- 

resented by 1 and 0. When no data is sent, 

or no data received, —1 is used. When the 

output speech at the receiver for the block 

of N samples is correct, the symbol C is 

used; when it is scrambled, i.e., frequency 
inverted, J is used. 
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coefficient of the scrambled sequence isp, = —p = +Ki, and Ki > K, 
y is now a logical 1, and data is deemed to be present having a value 
logical 1. Thus, the probability of a false bit being generated is very 
low, being the joint probability that o2 = T and p < —K. 

When the speech signal is in a noisy environment, the symbols x,.) 
representing speech in eqs. (1) to (4) are replaced by x(.) = x.) + m.), 
where n,.) is the noise component and the ’ above the symbols means 
noise contamination. The effect of the noise is to increase o% and 
decrease p’, and as both o%: and p’ must exceed their thresholds [see 
eq. (5)] for data to be transmitted, the TBR decreases. Provided the 
channel is ideal, the TBER will depend on the correlative properties of 
the received speech, and the only source of errors derives from state 
A, 1.e., TBER = FBR. 

When clean speech is used and the channel is noisy, the TBR is 
unaffected. However, the TBER increases with channel noise power 
because the noise decorrelates the received signal, causing the receiver 
to sometimes erroneously presume that no data was transmitted. Thus, 
states D and F apply for this condition, and as the existence of other 
states occurs with a much lower probability, the received bit rate is 
approximately the difference between TBR and FBR. 

Dispersive channels alter both the power and correlation of the 
recovered signal. The most common state is D which occurs when 
p < |K|. State C occurs when the scrambled speech arrives with a 
correlation p = K, causing 1 to be interpreted as a 0. State F occurs 
when p <|K|, or oz < T, or when both 6 <|K| anda: < T. The other 
states were found to rarely happen. 


IV. DATA TRANSMISSION PERFORMANCE 


The simultaneous speech and data transmission using frequency 
inversion scrambling, sspT/FIs, described here, was investigated using 
the sentences: “Live wires should be kept covered,” and “To reach the 
end he needs much courage”—spoken by a male and female, respec- 
tively. The speech signal was sampled at 8 kHz to yield 38,912 samples, 
a number sufficiently large to give a good indication of the system’s 
performance. The amplitude of the speech samples was confined to 
the range extending from —6000 to +6000 arbitrary units, and the 
mean square value of the samples averaged over both sentences was 
MS, = 1.09 x 10° or 60.4 dB relative to a mean square value of unity. 
The time waveforms for these two sentences and an expanded version 
of the magnitude of these speech samples to give the time variation of 
the low-level sounds are shown in Fig. 3. 

Our objectives were to determine how to select K, T, and N for high 
TBR and low or negligible TBER, and to study how the performance 
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Fig. 3—Time waveforms for “‘Live wires should be kept covered,” and, “To reach the 
end he needs much courage,” are shown in (a) and (c). The corresponding positive 
amplitudes of the waveforms for the low-level sounds (high amplitudes truncated), 
together with various values of VT, are displayed in (b) and (d), respectively. 


deteriorated in the presence of additive noise on the input speech and 
on the transmitted SSDT/FIS signal. We assumed that block synchro- 
nization between transmitter and receiver was correctly maintained at 
all times. 
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4.1 Selection of K 

The two sentences were processed sequentially. The speech samples 
were divided into blocks of N samples, where N could be either 8, 16, 
32, 64, 128, or 256, resulting in 4864, 2432, 1216, 608, 304, and 152 blocks 
of samples, respectively. For each value of N the probability density 
function (PDF) was computed for the correlation coefficient p, and 
plotted in Fig. 4. The ppFs were found to have similar shapes for N = 
16 to 256, although the shape marginally altered for N = 8. For smaller 
values of N, there is a translation in the position of the PpF to lower 
values of p. This arises because of the definition of p given by eq. (3). 
The maximum possible value of p for N = 4, 3, and 2 is 0.809, 0.707, 
and 0.5, respectively. We will subsequently show that N = 4 is the 
smallest block size of interest in this transmission system; therefore, 
we do not display pPpFs in Fig. 4 for N < 4. 

In the sspT/FIS system, with the threshold T set to zero, the signal 
used to transmit data is the original speech signal, for which the curves 
in Fig. 4 apply. However, if T > 0, more blocks of speech are rejected 
for the conveyance of data. Therefore, TBR decreases, and the resulting 
blocks available for data transmission have PDFs for the correlation 
coefficient that are different from those in Fig. 4. At this stage, we will 
confine the discussion to the case of T' = 0, i.e., where TBR has its 
highest values, and the curves in Fig. 4 are relevant. 


ESTIMATED PROBABILTIY DENSITY 





—0.2 0 
CORRELATION COEFFICIENT, p 


Fig. 4—Probability density function for the correlation coefficient p for different 
values of N. 
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Returning to these curves, we draw attention to their most negative 
correlation coefficient, pmin, values, as they can have a significant effect 
on the number of bit errors. The variation of pmin, and the maximum 
correlation coefficient pmax, aS a function of N is displayed in Fig. 5. We 
recall from our discussion in Section III, that if p < K, 02> T, no data 
is transmitted. Assuming an ideal channel, and given thato = p < — K, 
the system is fooled into believing a logical 1 was transmitted and a bit 
error occurs. Clearly, if K is selected such that p < — K does not exist, 
then no bit errors are possible over an ideal channel. To avoid bit 
errors we arrange for 


K> | Pmin|; (12) 


and the choice of K to avoid bit errors as a function of N must, 
therefore, be below the curve |pmin|, e.g., for N = 64, K > 0.43. For 
N < 16, pmin, and pmax both decrease with decreasing N, and for N = 4 
we have the interesting situation that |pmin| = Pmax, Which means that 
if Inequality eq. (12) is satisfied no data will be transmitted as p > K 
cannot exist. The value, N = 4, therefore, marks the lower limit of the 
block size for combined speech and data transmission over an ideal 
channel without the occurrence of bit errors. 

Reducing K from pmax increases the number of speech blocks that 
can be considered for the conveyance of binary data, but if K =| pmin|, 
bit errors ensue. Thus, in order to transmit the greatest amount of 
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Fig. 5—Variation of maximum (pmax) and minimum (pmin) correlation coefficient 
values for different block sizes (N). 
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Fig. 6—Variation of data transmission efficiency (y) and data transmission deficiency 
(A), as a function of K and T for N = 8. 


data without errors over an ideal channel, K is bounded by 
| Omin| < K < Pmax - (13) 


However, the negative tails of the ppFs are long and of low amplitude 
and, hence, K < |pmin| can be used provided the penalty of a low TBER 
can be tolerated. 


4.2 Ideal channel 


The effects of parameters K and T on the data transmission effi- 
ciency y, the TBR, the data transmission deficiency A, and the total 
TBER, for block sizes of 8 and 32, is shown in Figs. 6 and 7, respectively. 
These two block sizes were selected because N = 8 provides the highest 
data transmission rate in the absence of false bits, and N = 32 has 
fewer false bits than does N = 8 at low values of K, while having a 
relatively high TBR. Because the shape of the curves in Figs. 6 and 7 
are similar, we refrain from showing curves for other values of N. 

The curve for T' = 0 is of interest as it provides the highest values of 


SPEECH AND DATA TRANSMISSION 2091 


220 


200 


180 


TRANSMITTED BIT RATE, TBR, BITS PER SECOND 
DATA TRANSMISSION EFFICIENCY, 7), PERCENT 
DATA TRANSMISSION DEFICIENCY, A, PERCENT 

o 
TOTAL BIT ERROR RATE, TBER, BITS PER SECOND 


: T=0 
i OT = 50 
T=0TO100 b T= 100 4 
O 
\ oT = 103 
140 r= 10° 


\ 
\ 


© 7 = 104 


O - 
T=4.5x 104 
T=2x 10° 


120 





0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 


Fig. 7—Variation of data transmission efficiency (y) and data transmission deficiency 
(A), as a function of K and T for N = 32. 


n, and relates to the discussion in Section 4.1. If & is increased beyond 
0.8 the curve falls rapidly, becoming zero for K = pmax. As K is reduced 
below 0.2, n climbs towards 100 percent, but A becomes excessive. 
Consider the operating condition: T = 0 and K = |pmin|. For N = 8, 
K = 0.6, 7 = 71 percent, \ = 0, yielding a TBR of 710 b/s and a TBER of 
zero. By reducing K to 0.2 while maintaining T = 0, 7 is increased to 
90 percent, or to a TBR of 898 b/s. However, A is now 3.35 percent, 
giving a TBER = FBR of 33.5 b/s. These FBRs arise because K is below 
| Pmin]. The system can operate with low values of K, 0.2 say, provided 
T is increased. By raising the value of T, blocks which occur during 
silence and unvoiced periods are not considered for data transmission. 
For still higher values of T, blocks existing during silence, unvoiced, 
and low amplitude voiced sounds, are rejected for the conveyance of 
data. The values of T used in our experiments (other than T = 0), 
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namely 50, 100, 10°, 10%, 4.5 x 10*, 2 x 10°; correspond to power levels 
that are 43.4, 40.4, 30.4, 20.3, 13.8, and 7.36 dB, respectively, below the 
mean square value MS, of the combined speech signals. The VT 
thresholds, except 50 , are shown in Fig. 3(b) and (d), and for reference 
(2 x 10°)!” is shown in Fig. 3(a) and (c). 

The effect of using non-zero values of T is a modification of the 
shape of the ppr of the correlation coefficient p—specifically, the 
truncation of its long negative tail. Consequently, pmin, a negative value 
in Fig. 3, is made significantly more positive. For example, when N = 
8, Pmin = —0.6, —0.56 and —0.07 for T = 0, 10° and 2 x 10°, respectively. 
When T' = 2 X 10°, no false bits occur, irrespective of K, as shown in 
Fig. 6, but 7 decreases significantly to 48 percent, giving a TBR of 480 
b/s. Clearly, for the ideal channel, there is no advantage in making T 
anything other than zero and K = 0.6. We will find that in the presence 
of channel noise 7 must have a high value if \ is to be contained. 

Although increasing N generally produces higher values of 7, as can 
be seen in Fig. 7, the larger block size results in a significant reduction 
in TBR. The effect of N on A is seen to be small; therefore, we 
recommend the use of N = 8. 

The effect of block size N on TBR and TBER for different values of T 
is shown in Fig. 8, where K = 0.5. The data transmission efficiency is 
approximately independent of N for a given T, and consequently TBR 
is inversely proportional to N. [See eq. (8)]. False-bit errors occur for 
N = 16 and 8 as pmin < —0.5, unless T is increased to ~4.5 x 10‘. By 
using this high value of 7, TBR is seen to fall from 530 b/s to 19.5 b/s 
as N is increased from N = 8 to 256, the TBER being maintained at 
zero. Clearly, from a data transmission point of view, high values of N 
are undesirable, although they do increase the listening fatigue of an 
eavesdropper, as described in Section 5.2. 

The small block size of N = 4 that spans a duration of 0.5 ms can be 
used to transmit data without error, provided a large value of T is used 
to remove the long tail in the correlation coefficient ppF of Fig. 4. We 
found that if T = 2 x 10°, A = 0, and n = 31.6 percent. This represents 
a TBR of 632 b/s and a TBER of zero. 


4.2.1 Effect of background noise 


To simulate a noisy environment, we added a random noise sequence 
having a power o7; to the speech sequence. The effect of this back- 
ground noise power on the data transmission efficiency 7 and TBR for 
different values of threshold T is shown in Figs. 9 and 10 for N = 8, 
K = 0.6, and N = 32, K = 0.5, respectively. An additive power level of 
a2: = 10°, k= 0, 1, 2, --- , corresponds to a power level of 60.4-10 k, dB, 
below the mean square value MS, of the speech signal. As expected, 
the highest value of 7 occurs when T = 0, as the only criterion applied 
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TRANSMITTED BIT RATE, TBR, BITS PER SECOND 
TOTAL BIT ERROR RATE, TBER, BITS PER SECOND 





‘\ /\ 
8 16 32 64 128 256 
BLOCK SIZE, N 


Fig. 8—Variation of data transmission efficiency (yj) and data transmission deficiency 
(A), as a function of block size (N) for different values of T, K = 0.5. 


in the selection of blocks to convey data is based on whether the 
correlation coefficient p is above K. Increasing o7; causes the speech to 
be decorrelated; therefore, less blocks have p > K, and consequently 
n decreases. Increasing T means that fewer blocks fulfill the condition 
that y of eq. (5) is a logical 1. The blocks discarded are generally those 
containing low-level speech, and it is these blocks that experience 
greatest decorrelation. Thus, as 0%; increases, 7 remains constant as 
the decorrelative effect is masked by the value of T. When o3; ap- 
proaches 7, blocks not rejected because their mean value is >T are 
now abandoned because of p being too small due to the decorrelation. 
Consequently, 7 versus 07: is no longer a constant, and 7 coalesces with 
the T = 0 curve as o3; is further increased. This occurs because the 
controlling factor in block rejection is now the correlation criterion. 
For 03: > T, n decreases at approximately 6.8 percent per decade 


2094 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


GNOOAS 3d SLId ‘YAEL ‘ALVY YOUR LID TVLOL 
N = ° 





«< 
| 
! 
! 
lo 
uw 
a 
i 





AN3OY 3d ‘Y ‘AON3IDISIG NOISSINSNVYL VLvG 


N=8. 


0.6, 


;) on data transmission efficiency 


2 
On; 
2 
ni 


' 


ADDITIVE NOISE POWER 


Sn oe 


a ee eer rt 


Fig. 9—Effect of additive background noise power (o 
(4) and data transmission deficiency (A), as a function of threshold T. K 


LN39u3d 'W‘ADN31D1443 NOISSINSNVYL VLVG 


[=] oO [=] o [=] io] 
Oo uw j=) 1) {=} wD 
o w wo vt + oO 


GNOODAS ¥3d SLI ‘HSL ‘JLWY LIS GALLIWSNVYL 


SPEECH AND DATA TRANSMISSION 2095 


DATA TRANSMISSION EFFICIENCY,7), PERCENT 
TRANSMITTED BIT RATE, TBR, BITS PER SECOND 





ADDITIVE NOISE POWER, 0,3 


Fig. 10—Effect of additive background noise power (o7:) on data transmission effi- 
aed (yn) and data transmission deficiency (A), as a function of threshold T. K = 0.5, 
= 32,A=0. 


increase in 07; when N is 32, and at a rate approaching this value when 
N is 8. 

The data transmission deficiency and the TBER was found to be 
zero for the case of N = 32, over the range of power levels shown in 
Fig. 10. However, when the block size was 8, data errors occurred. The 
variation of \ and TBER with o3; for different values of T is shown in 
Fig. 9. The figure demonstrates that data errors can be avoided for 
oni < 10* by setting T to 2 x 10°, although the data transmission 
efficiency falls to 43.5 percent, i.e., TBR = 435 b/s. 


4.3 Noisy channel 


When no noise was added to the speech signal, but the channel was 
noisy with additive channel noise power o7c, 7 was uneffected. How- 
ever, A increased due to blocks that did not contain data (o2 < T) but 
had their power increased to o2 + o3- = T, and if 6 or ps exceeded K, 
data errors ensued. The variation of \ and TBER with o3, for various 
values of T is displayed in Figs. 11 and 12, for N = 8 and 32, 
respectively. As A and TBER had zero values for large values of T when 
N = 32, we present a zero line in Fig. 12, and the lines from this base 
to the other points on the curve are dotted. Notice in Fig. 12 that no 
data errors were recorded over the entire range ofo%- when T' = 2 X 
10°, and from Figure 10 this value of T corresponded to y = 50.5 
percent. Thus, by using N = 32, and a background noise power and 
additive channel noise power up to 10”, i.e., up to 10.4 dB below MS,, 
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we found that 126 b/s can be transmitted without error. When N = 8, 
T = 2x 10°, and both types of noise are present up to 10°, TBR = 435 
b/s, but TBER is no longer zero, having a value between 0.2 to 3.5 b/s. 
The various combinations of TBR and TBER can be deduced from Figs. 
9 through 12. 


V. SPEECH TRANSMISSION PERFORMANCE 


Emphasis has been given to data transmission because we wanted 
to investigate if it could be reliably achieved using speech as a carrier 
signal. In the previous section, we presented results showing that it 
was possible to transmit data without transmission errors, and conse- 
quently the recovered speech signal was unimpaired by conveying the 
data. However, we have also observed that the data rate can be 
increased if bit errors can be tolerated. Thus, we now address the 
problem of how the bit errors affect the recovered speech signal, and 
specifically ask, How serious is the degradation of speech quality and 
intelligibility when the total TBER approaches the maximum values 
found in our experiments? 


DATA TRANSMISSION DEFICIENCY,A , PERCENT 
TOTAL BIT ERROR RATE, TBER, BITS PER SECOND 





1 10 102 103 
ADDITIVE CHANNEL NOISE POWER O;2, 


Fig. 11—Effect of additive channel noise power (¢2-) on data transmission efficiency 
(n) and data transmission deficiency (A), as a function threshold T. K = 0.6, N = 8. 
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Fig. 12—Effect of additive channel noise power (o3-) on data transmission efficiency 
(n) and data transmission deficiency (A), as a function of threshold 7. K = 0.5, N = 32, 
n = 63.7. 


5.1 Objective measure 


To provide an objective measure of the degradation of the recovered 
speech signal that accrues solely from the effect of data errors and not 
from the presence of additive background or channel noise, we elected 
to use segmental s/n (SEG-S/N). This ratio is used”® as an objective 
measure because its value corresponds more closely to perceived 
quality than those of conventional s/n measurements, i.e., those using 
the ratio of mean square signal power to mean square noise power, 
determined over the duration of the entire signal. The reason for this 
resides in the computation of SEG-S/N, which is performed as follows. 
The input speech sequence {x;} is divided into contiguous blocks of 
128 samples, i.e., into periods of 16 ms. Only those blocks, for example, 
M, whose rms value exceeds I’ dB (here —60 dB) relative to the peak 
value, have their s/n calculated. The s/n for the jth block is 


128 


2 
> X 1287 +i 
i=] 


s/n; = 10 logio) 73 


> (x1287+i — L128 j+i)? 
i=1 


J=1,2,---,M, (14) 
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where {Z,} is the recovered speech sequence. There is no point in 
considering s/n; < —10 dB, or > +80 dB, because the speech quality is 
not perceived worse or better than —10 dB or +80 dB, respectively. 
Hence, 


s/n; = —10 dB for s/n; = —10 dB 
and 
s/n; = +80 dB for s/n; = +80 dB (15) 


for 7 = 1, 2, --- ,M. The number of data blocks of length N contained 
in the s/n, calculation block size of 128, decreases from 16 to 0.5 as N 
is increased from 8 to 256, respectively. For a given TBER, the effect of 
increasing N is to cause the number of blocks not having the maximum 
s/n; of 80 dB to decrease, but the decrease in s/n; is more substantial 
in those blocks of 128 samples where erroneous scrambling occurred 
with all the samples, compared to those blocks where only,8 samples 
were erroneously scrambled. 

The segmental s/n is computed as the average of s/n;; 7 = 1, 
2,---,M, namely, 


1 M 
m2 s/n;. (16) 
The I threshold enables us to ignore blocks of low-level speech in the 
calculation of SEG-s/N. Even if these blocks are erroneously scrambled 
at the receiver output, their removal from the calculation is justified 
on the basis that the effect on such a low-level sound is imperceptible. 
Let us now consider the case of speech conveying data through an 
ideal channel without the introduction of data errors. Here SEG-S/N is 
80 dB, as s/n,, for all 7, is forced to 80 dB. When data errors occur, the 
SEG-S/N does not fall greatly below 80 dB, implying that the degrada- 
tion in speech quality because of the presence of data errors is small. 
Perceptual observations substantiate this implication, confirming our 
decision to use SEG-S/N as an objective measurement of performance. 
When the input speech is contaminated by statistically independent 
background noise, the recovered speech at the receiver is the sum of 
the original speech and noise signals, provided there are no data errors. 
Whenever a data error occurs, both the speech and the noise in the 
block are scrambled. In the case of additive channel noise, and for no 
data errors, the recovered speech is, again, the sum of the original 
speech and the noise signals, except when the data is a logical 1 when 
the output noise signal is scrambled. The effect of data errors, exclud- 
ing states B and F, is to cause the output signal to be the sum of the 
scrambled original speech signal and the scrambled noise signal. Per- 
ceptually, the scrambled noise signal is the same as the original random . 


SEG-S/N = 
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noise signal; therefore, its effect must be removed in evaluating the 
loss of SEG-S/N because of data errors. This is achieved by noting those 
blocks where data errors occurred in a noisy environment, and then 
scrambling the original speech in those same blocks to give the 
sequence {X;} used in eq. (14). By this method, we are able to separate 
the distortion in the output signal, caused by the additive noise, from 
the effect of the data errors that were precipitated by this noise. 


5.1.1 Objective results 


The occurrence of a bit error results in a block of speech samples at 
the output of the receiver being erroneously scrambled. Clearly, if this 
happens to a block containing high amplitude samples, significant 
distortion ensues. From Section IV we have seen that the reduction or 
elimination of data errors can be achieved by increasing T. However, 
this is not our purpose here. We wish to generate data errors and 
observe their effect on the recovered speech. Consequently, in selecting 
conditions to illustrate the reduction in SEG-S/N caused by data errors, 
we have opted for T = 0. Table II shows the SEG-s/Ns for some of the 
worst data-error conditions shown in Figs. 5 to 12, plus a high data- 
error case when N = 256. These conditions were selected to show that 
in spite of the TBER values being unacceptably high for most data 
communications systems, the effect of the data errors on speech quality 
is small. Indeed, SEG-S/N remains above 66 dB for all the conditions 
depicted in Table II. Therefore, we refrain from presenting detailed 
measurements of speech distortion that is barely perceptible. However, 
we do discuss the error conditions, but cannot make general deductions 
from the few entries in the table, particularly as there is not a consistent 
theme. For example, for the noisy channel there are three values of N, 
but they each have a different K, so comparisons must be tempered 
with caution. 

In Table I we have included the recovered speech status at the 
receiver for each of the data-error states. Two states, B and F, do not 
cause the recovered speech signal to have the samples in the erroneous 
blocks scrambled. When additive channel noise is present, error states 
D and F occur more frequently than the other states. Thus, the 
distortion in the output speech results mainly from state D, i.e., when 
a transmitted logical 1 is ignored. As states D and F are likely to occur 
with approximately the same probability (see Table II), the error rate 
in terms of distorting the recovered speech, can be considered to be 
reduced by a factor of two. However, state D is associated with a loss 
of a data signal caused by the channel noise increasing the correlation 
of a block of speech samples. Because data was transmitted, the speech 
can be voiced (although 7' = 0 for Table II) in which case the speech 
distortion may be substantial. 
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Table II—SEG-SN for some high data-error conditions, T = 0 


Data Data 
Transmis- Transmis- 
Corr. Additive Occurrence of Data-Error States sion Defi- sion Effi- 
Block Thresh- Noise oo =_ Clency A, TBER ciency 7, TBR SEG-S/N 
Condition SizeN old K Power o2 A B C D E F % b/s % b/s dB 
Ideal channel, no ad- 8 0.2 ZERO 154 0 Oo 0 0 0 3.17 31.7 89.7 897 72.2 
ditive noise 
Noisy channel 8 0.6 918 6 32 1 3802 1 = 286 12.8 128 80.0 800 66.7 
Ideal channel, additive 8 0.6 9.18 x 104 34 0 O 0 O 0 0.47 4.7 40.0 400 76.1 
background noise 
Ideal channel, no ad- 32 0.2 ZERO 36 0 O 0 O 0 2.96 7.4 91.3 228 74.5 
ditive noise 
Noisy channel 32 0.5 918 0 0 0 99 0 85 15.5 38.8 84.5 211 75.6 
Noisy channel 256 0.2 4.3 x 10° 1 0 0 10 O 10 14.6 4.56 93.4 29 77.4 


The data errors resulting from the ideal channel, with or without 
additive background noise, cause state A to apply, as shown by the - 
examples in Table II. Although the output speech blocks are erro- 
neously scrambled every time a data error occurs, the distortion is 
confined to blocks that often contain unvoiced sounds as the data error 
is the result of no data being transmitted, but a logical 1 being falsely 
generated. 

Waveforms for the worst condition shown in Table II, namely, 
additive channel noise, N = 8, T = 0, K = 0.6,02. = 918, are displayed 
in Fig. 13. The high amplitude signal levels are seen to be substantially 
unaffected by the data errors whose effects are often immersed in the 
channel noise, and are therefore not perceptibly annoying. 


5.2 Informal listening experiences 


Informal listening tests were performed for the conditions listed in 
Table II. The recovered two sentences of speech, stripped of noise, 
with blocks of speech erroneously scrambled where data errors oc- 
curred, suffered only minor distortions. For the ideal channel, minor 
distortions resembling a “sshing” sound, occurred on three occasions 
for the cases of N = 8 and 32. A quiet noise, like additive white noise, 
was perceived for N = 8 when either background, or channel noise 
(the worst condition) were present. For the noisy channel condition, 
N = 32 produced the effect of barely perceptible scratches, while N = 
256 yielded the least distortion, where the degradations were reminis- 
cent of barely audible metallic clicks. 

When the noisy output signal containing the effects of data errors 
was compared to the original speech plus additive noise, the effect of 
the data errors was imperceptible in the case of the substantial additive 
input noise power 07; = 9.18 x 10‘, N being 8. The effect of unwanted 
scrambling when the channel was noisy ranged from barely perceptible, 
N = 256 and 32, to nonannoying crackles when N = 8 ando?. was only 
918. 

The conclusion is that for the data-error rates of practical signifi- 
cance, the degradation in speech quality is insignificant. 

The transmitted signal sounded like distorted speech, plus white 
noise for the case of N = 8, and an ideal channel. The effect of additive 
background or channel noise was to reduce the fatiguing effects, as if 
the distortion had been removed from the speech and the background 
noise increased. When N = 82, the channel ideal, the distortion was 
increased as this block size corresponds to 4 ms, approximately half a 
pitch period. The speech sounded as if speaking and gargling were 
being performed simultaneously. The act of adding noise marginally 
reduced listening fatigue. The scrambled signal was found to be just 
intelligible when N = 256. 
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Fig. 13—Effect of additive channel noise. (a) Original speech. (b) Transmitted signal 
with additive channel noise having 02, = 920. (c) Recovered speech having data errors 
(A = 12.8 percent) and additive noise. N = 8, T = 0, K = 0.6. 


VI. DISCUSSION 


We started by enunciating a principle: that data could be transmitted 
by making it the scrambling key, and casting the receiver in the role 
of code breaker. Every time the receiver guesses the key, it obtains the 
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correct data and the correct speech. The speech is made an unwitting 
data carrier, while the data gets a free ride. The implications of this 
concept are considerable. Continuous users, or providers, of telephone 
traffic can, at the expense of additional terminal equipment, surrepti- 
tiously transmit teleprinter data, with the proviso that the bandwidth 
of the speech signal and the block size are appropriate for the channel 
bandwidth. 

To demonstrate the principle, we wanted a scrambling technique 
that was easy to implement, and where the bandwith of the scrambled 
speech was not larger than that of the original speech. Frequency 
inversion scrambling aptly fulfilled these requisites, where the scram- 
bling is achieved by merely altering the polarity of every other speech 
sample. We have shown that by using this form of scrambling, it is 
possible to transmit speech and data simultaneously, and to receive 
the data without errors and the speech without distortion, even in the 
presence of additive-channel and background noise. Provided some 
data errors can be tolerated, the data rate can be substantially in- 
creased as shown in Figs. 6-12. Even at high data-bit rates the 
distortion in the speech was found to be minimal, as the results in 
Table II indicate. 

We have not presented results for dispersive channels, although we 
did do some experiments. The effect of such channels was to signif- 
icantly alter the ppF of the correlation coefficient of the received signal 
compared to that of the transmitted signal. The power in the blocks of 
speech was also changed by the dispersive properties of the channel. 
As the data detection procedure is based on a measurement of power 
and correlation in a block of N samples, where the correlation is 
usually the most important factor, the dispersive channel results in an 
unacceptably high TBER. Thus, in the presence of dispersive channels, 
equalization of the channel must be performed. 

The speech used in our experiments were two sentences whose 
waveforms are displayed in Fig. 3. Therefore, the results will differ 
when other sentences are used, but not significantly, as the sentences 
used consisted of over thirty-eight thousand samples. The system 
proposed here is for conveying data on speech or short silences. When 
prolonged silences occur, we envisage data being transmitted by con- 
ventional modern techniques. 

The basic principle established, the way forward is to find scrambling 
methods that will be easier to break with certainty, and will operate 
via dispersive channels without the necessity of channel equalization. 
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Frequency Scaling of Speech Signals by 
Transform Techniques 
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(Manuscript received May 15, 1981) 


The general framework of short-time Fourier analysis, modifica- 
tion, and synthesis is used to describe in a unified way several known 
techniques for frequency scaling of speech signals. Subsequently, a 
frequency domain harmonic scaling technique is studied in detail 
with emphasis on improving its performance and its implementation 
efficiency. This technique is particularly attractive for 2:1 scaling by 
use of a sign tracking algorithm which avoids the need for explicit 
phase computation and unwrapping. The implementation efficiency 
is achieved by using the fast Fourier transform algorithm, embedded 
decimation and interpolation, and an extended version of a recently 
developed weighted overlap-add synthesis scheme. The improvement 
in quality is achieved by improved sign tracking and elaborate design 
and selection of the analysis and synthesis prototype filters (data 
windows). Results of computer simulations, for a variety of adverse 
acoustical environment conditions, indicate that the system is highly 
robust but its quality for clean speech is lower than with a time 
domain harmonic scaling technique which uses pitch information. In 
applications which do not permit pitch transmission, a hybrid scheme 
which combines the two techniques is found to yield a better quality 
than either system alone. 


1. INTRODUCTION 


Frequency scaling of speech signals is a useful method for reducing 
the bandwidth requirements in analog and digital speech transmission 
systems.’” In analog systems the frequency compressed signal is 
transmitted at reduced bandwidth. In digital systems the frequency 
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compressed signal is waveform coded to provide reduced bit-rate 
transmission.*’ A general block diagram of such a digital system is 
shown in Fig. 1. In this figure, the schematic spectral representation of 
the input speech signal is shown to consist of a spectral envelope with 
pronounced resonances (formant peaks) and of a fine structure because 
of pitch harmonics in voiced speech. The spectral envelope of the 
compressed signal is a scaled version of the input spectral envelope. 
However, different frequency scaling techniques may result in different 
fine structures. 

Since we do not refer at this point to any specific technique, Fig. 1 
does not show the fine structure of the compressed signal. This suggests 
that frequency scaling techniques can be classified according to the 
way the fine spectral structure is scaled. In particular, one can distin- 
guish between narrow-band techniques, such as the phase vocoder” 
and time-domain harmonic scaling (TDHS)* techniques, which aim at 
separating and scaling the individual pitch harmonics, and wide-band 
techniques, such as the analytic signal rooting (ASR) technique® and 
the more recent constant Q transform (car) method,®” which aim at 
directly scaling the spectral envelope. The much earlier Vobanc and 
CODIMEX systems also fall into the latter category.” 

Another way of classification is to distinguish between time- and 
frequency-domain techniques. To provide useful quality, time-domain 
techniques require pitch tracking, as done in the TDHS technique.* 
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Fig. 1—General block diagram of a digital coding system which applies frequency 
scaling for bit rate reduction. 
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Once the pitch is known, the time-domain operations are simple and 
result in good quality scaled and reconstructed speech.””’ Frequency- 
domain techniques are typically much more complex but do not require 
explicit pitch tracking. However, they usually have a lower quality 
because of errors made in resolving phase ambiguity and from the 
need, in general, to scale both the phase and amplitude signals, as will 
be elaborated later on. For applications in which pitch tracking is not 
desired or not possible, because of adverse acoustical environment 
conditions, the use of an efficient frequency domain technique is of 
much interest. 

In this work we present an efficient implementation of a frequency- 
domain harmonic scaling (FDHS) technique which is based on an 
improved version of the technique presented in Ref. 12. Frequency- 
domain harmonic scaling is a narrow-band technique which aims at 
scaling the individual pitch harmonics. It is particularly attractive for 
2:1 scaling, since in this case a sign tracking algorithm avoids the need 
for explicit phase computation and unwrapping (that is, eliminating 27 
phase ambiguities), which in general is a difficult and error-prone 
task.’® The efficient implementation is based on the recently developed 
weighted overlap-add method for short-time Fourier analysis/synthe- 
sis, which allows block processing using the fast Fourier transform 
(FFT) algorithm.” It is extended here to include analysis and synthesis 
windows which are both longer than the FFT block, or the transform 
size. 

The general framework of the short-time Fourier transform (STFT) 
as developed in several recent works’*"* also provides a unified de- 
scription of other known frequency scaling techniques, and helps to 
relate them to the FDHs technique. This unified description is given in 
the following section, and is followed by a detailed description of the 
FDHS technique. Section IV gives the details of the implementation 
scheme and Section V discusses design considerations and simulation 
results. Section VI presents a hybrid technique which combines TDHS 
and FDHs. This combination is designed for applications in which it is 
feasible to extract the pitch at the transmitter but for which the 
transmission of pitch data is either impossible or is to be avoided. 


Il. A UNIFIED DESCRIPTION OF FREQUENCY SCALING TECHNIQUES 


A general scheme for frequency scaling is presented in this section. 
It is based on viewing the frequency scaling operation as a modification 
of the short-time spectrum of the speech signal. This scheme is then 
-used to describe in a unified way several known frequency scaling 
techniques. Our attention in describing the different techniques will 
be mainly focused on the nature of the spectral modifications used by 
each technique, and not necessarily on the way they are implemented. 
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The relation between the short-time Fourier transform (sTFT) and 
a filter-bank analysis is well established.””” For the convenience of this 
presentation, the filter bank which is used to divide the two-sided 
speech spectrum into sub-bands is assumed to consist of N complex 
bandpass filters. The center frequency of the kth filter is denoted by 
w, and its complex (or analytic) output signal by 2z;,(t). It is also 
assumed that each bandpass filter has a real low-pass filter prototype, 
which means that the complex impulse response h;(t), of the Ath filter, 
is given by 


hi(t) = w,(t)exp(Jwrt), (1) 


where w;(t) is the impulse response of the low-pass prototype of h;(¢). 
Note that in general the prototype filters need not be identical, but we 
assume that the bandpass filters are contiguous and are arranged 
symmetrically about w = 0, so that the filter centered at w = —w, has 
the same prototype filter as the one centered at w = wz. This way the 
summation of the outputs from a pair of corresponding (conjugate) 
complex filters results in a real signal. 

The output signal from the #th complex filter has the general form 


2x(t) = Az(t)exp[ 76,(t)], (2) 


where A;(¢) is the amplitude, or envelope, function and @,(t) is the 
phase function. The phase function can be written as a sum of two 
components 


O:(t) = wrt + ox(t), (3) 


where the meaning of ¢;(t) is elaborated below. By substituting eq. (3) 
into eq. (2), we see that z,(t) can be interpreted as being the result of 
the simultaneous modulation of the amplitude and phase of the com- 
plex carrier signal exp(jwzt) by the amplitude and phase signals A;(¢) 
and ¢,(¢), respectively. The instantaneous frequency of z;(t) is given 
by the phase derivative 6,.(t) = we + o(t), so that b(t) is seen to be 
the deviation of the instantaneous frequency from the center frequency 
w,. Frequency scaling of z,(¢) by a factor g (¢ < 1 for compression and 
q > | for expansion), is achieved if the center frequency w, is scaled or 
shifted to gw, and the bandwidth of z;(t), about w,, is also scaled by q. 
It is well known that the bandwidth of a signal which is characterized 
by simultaneous amplitude and phase modulation of a carrier signal is 
a function of both modulating signals.’” Hence, just scaling the instan- | 
taneous frequency deviation ¢,(t) by a factor g does not result, in 
general, in the exact scaling of the bandwidth of z,(t) by g. The lack 
of adequate analytical models which describe the time variations of 
the amplitude and phase modulating signals—for an input speech 
signal—has resulted in a variety of frequency scaling techniques which 
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use different analysis filter banks and different modifications of the 
modulating signals. In the following, the modifications applied to the 
modulation signals by different frequency scaling techniques are used 
to analyze and compare the different techniques. We first, however, 
show that the modification of A;,(t) and ¢,(t) corresponds to the 
modification of the short-time spectrum of the speech signal. 

Since z;,(¢) is the output signal from a bandpass filter having an 
impulse response h,(¢) it can be expressed as the convolution between 
the input speech signal x(t) and h,(t). From eq. (1) this results in 


zp(t) = X(wp, t)exp(jwxzt), (4) 


where 


X (wz, t) = | x(r)w,(t — r)exp(—JweT)dr. (5) 


Comparing eq. (4) with eq. (2) and using eq. (3), we have 
X(w,, t) = A;(t)exp[ joz(t) ]. (6) 


This shows that the amplitude and phase modulations of the carrier - 
exp(jw,t) are fully described by the composite modulation function 
X (wz, t). Additional understanding of these modulation functions can 
be gained from the studies in Refs. 18 and 19. The expression for X(w», 
t) in eq. (5) shows that X(wz, t) is equal to the value of the strT of x(t) 
at the frequency w = wz, if w,(t) is the window function used to weight 
the input signal.””” It should be emphasized again that in the present 
discussion the different bandpass filters covering the speech band 
have, in general, different prototype filters. Hence, for each bandpass 
filter one can define an stTFT which, if evaluated at the center frequency 
of that filter, gives the corresponding composite modulation function. 
If all bandpass filters have identical prototype low-pass filters, only a 
single sTFT is needed to find the value of X(wz, ¢) for each k, by 
evaluating the sTFT at each center frequency. With this understanding, 
we will refer to X(w,z, t) as the STFT of x(t) at w = wz, even for the 
general case of nonidentical prototype filters. 

Denoting the frequency scaled version of 2,(t) by 2 :(¢) and the 
corresponding modified strT by X,(w:z, t), we have 


Z(t) = Xq(wp, t)exp(jquzt). (7) 


The magnitude and phase components of X,(wz, ¢) are accordingly 
denoted by A,z(t) and ¢ z(t), respectively. As noted above, the modi- 
fication of X(w,, £) needed for exact frequency scaling of speech signals 
is not known, and the bandwidth of the individual sub-band signals is 
usually only partially scaled by any given technique. Hence, to avoid 
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excessive interband aliasing when the partially scaled sub-band signals 
are combined, additional filtering of z,,(t) may be needed. The filtering 
of Zq:(£) can be performed either by bandpass filters having a band- 
width which is g-times the bandwidth of the analysis filters, or equiv- 
alently by low-pass filtering the modified strT by the corresponding 
low-pass prototype filters. Figure 2 shows a general block diagram for 
frequency scaling which is based on modifying the stFrT of the input 
signal as discussed above. The impulse response of the synthesis low- 
pass filters which are used to band-limit the output signals in each 
channel are denoted by we,(t). These scaled-bandwidth synthesis 
filters can generally be obtained from w,(t) by the relation w,,(t) = 
w,(qgt). In the diagram of Fig. 2, only the details of the kth channel are 
given since all the other channels are similar (see solid line). The 
filtered modified strT is denoted in Fig. 2 by X, (wn, t). The output 
scaled speech signal ¥,(t) is given by 


Jolt) = x Zqx(t) = Y Xow, t)exp(jquzt), (8) 


where, as seen in Fig. 2, 2,,(t) is the Ath-channel scaled and filtered 
bandpass signal. The summation in eq. (8) is over the N sub-bands. 

It is clear from the above discussion, and from the block diagram in 
Fig. 2, that the choice of the filter bank and the stFrT modification are 
the key issues for any given technique. While the block diagram in 
Fig. 2 provides a basis for comparing different techniques, the actual 
implementations can differ, either because of historical reasons or the 
availability of more efficient or convenient ways for implementation. 

We turn now to the description of several known frequency scaling 
techniques in terms of the STFT modification used by each technique. 
This will exemplify the above discussion and will provide a proper 
perspective for discussing the FDHs technique and its properties and 
implementation. 


—_—> oat 
k— — X(W¢!) Xql Wx, t) oF, ; 
exp(—j Ox) A ' exp{ /q W ¢t) ~ \ 





ANALYSIS MODIFICATION SYNTHESIS 


Fig. 2—General block diagram of a frequency-scaling system based on short-time 
spectral modification (Ath channel shown in solid line). 
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2.1 Analytic signal rooting 


In the analytic signal rooting technique® the number of bandpass 
filters is chosen to match the formant structure of speech signals so 
that, preferably, no more than one formant is present in each sub- 
band. The approach taken in Ref. 3, as well as in the earlier CODIMEX 
system," is to obtain z,,(t) by raising the analytic signal z(t) to the 
power of q. If g < 1, this corresponds to taking the 1/q root of z,(t) 
which is the origin for the name of this technique. Using the relations 
in eqs. (4) and (7), we find that the strT modification performed by 
this technique is 


Xq(wr, t) = LX (wz, t)]?. (9) 


In terms of the modulating amplitude and phase signals, this modifi- 
cation corresponds to 


Agr(t) = [Ax(t)]?, (10a) 
and 
dgk(t) = Qox(t). (10b) 


To understand the effect of this modification, we note that since 
each sub-band is to contain no more than one formant, it can be 
expected that most often one-pitch harmonic, the one closest to the 
peak of the formant, is dominant to the other harmonics in that sub- 
band. From the analysis in Refs. 20 and 21 one can conclude that the 
phase-scaling operation in eq. (10b) scales the instantaneous frequency 
of the dominant harmonic in each band by gq, but the other lower 
amplitude harmonics are shifted in such a way that their spacing from 
the dominant harmonic remains unchanged. The result of this trans- 
lation is that the fine structure spectral components are not necessarily 
harmonic, although their spacing is equal to the pitch frequency. The 
scaling of the amplitude signals in the way given by eq. (10a) can be 
shown to scale the magnitude of the nondominant components (har- 
monics) relative to the amplitude of the dominant component. It also 
affects the intermodulation terms generated by the phase scaling. For 
q <1, the effect is to reduce the magnitude of the nondominant 
components relative to the dominant one and hence, effectively, to 
reduce the bandwidth of the frequency-scaled formants. To avoid 
excessive interband aliasing, it is particularly important in this tech- 
nique to use the band-limiting low-pass filters w,,(t) following the 
modification. 

For more effective scaling of the amplitude signals, and with respect 
to the cat which uses constant-Q bandpass filters, consider the ap- 
proach by Ravindra.’ He suggests that the A;(t) be spectrally analyzed 
for each k& by an additional bank of filters and that the bandwidth be 
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scaled by scaling the phase in each sub-band—this can be repeated in 
a tree-like structure.’ The implementation complexity of this approach, 
however, appears to be exorbitant. 


2.2 Phase vocoder 


The phase vocoder, as its name indicates, can be used directly as a 
vocoder system in which the phase derivative and magnitude of the 
input signal sTFT are coded and transmitted.’®””** The phase vocoder 
technique can also be applied for frequency scaling and this aspect is 
considered here.” 

In the phase vocoder, the number of bandpass filters is chosen to 
match the harmonic structure of voiced speech.” This means that a 
relatively large number of filters is used so that, preferably, no more 
than one-pitch harmonic is present in each sub-band. The fact that 
individual harmonics are separately scaled, allows us to infer the 
characteristics of the modulation signals in each band from known 
speech properties. In particular, since pitch and vocal tract variations 
are relatively slow, the bandwidth of each pitch harmonic is quite 
narrow, as shown by the “pitch teeth” in the input spectrum shown in 
Fig. 1. In view of this fact, one would expect that even if the pitch 
harmonics are only shifted to the proper frequencies, without scaling 
the bandwidth of each pitch tooth, acceptable compression can be 
achieved (i.e. only a small interharmonic aliasing is expected), provided 
that the compression ratio is limited to 2 or at most 3. Indeed, this 
finds support in the results obtained with the TpHs technique which 
we discuss later.*” However, to shift the pitch harmonics to the proper 
locations requires knowledge of the pitch frequency or, equivalently, 
the deviation of each pitch harmonic from the center frequency of the 
sub-band in which it is located. 

Let 2, be the pitch-harmonic frequency in the th sub-band, with 
center frequency w,, and AQ, the deviation of the pitch harmonic from 
the center frequency; i.e., AQ, = 22 — wr. Then, the phase derivative 
¢x(t) can be expressed as 


dx(t) = AQ + ¥(8), (11) 


where ¥;(t) describes the contribution of the phase variations to the 
bandwidth of the pitch harmonic in the Ath sub-band. In the phase 
vocoder technique, the phase derivative };(t) is scaled by qg, so that in 
addition to shifting each pitch tooth to its proper location, a partial 
scaling of its bandwidth is obtained since Y,(t) is scaled as well. The 
amplitude modulation signals are not modified in this technique. 
However, since individual harmonics are analyzed, the amplitude 
signal in each band varies slowly [see (a) of Fig. 5 in Ref. 19], and its 
contribution to the pitch-tooth bandwidth is expected to be small. 
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Accordingly, the modified amplitude and phase signals are given by 


Ag(t) = A;(é), (12a) 
and 
dgr(t) = | qox(7)dr. (12b) 
of) 


It is observed from eq. (12b) that the constant phase term ¢,(fo) is 
discarded [note: ox(t) = Jt, ox(t)dr + ¢x(to)]. This can have an effect 
on the shape of the scaled signal waveform, but because of the relative 
insensitivity of the ear to a fixed phase distortion, it was not judged to 
be perceptually significant.” 

Since in this technique individual pitch harmonics are scaled and 
the interband aliasing is expected to be small, use of the output 
synthesis filters, denoted by w,,(t) in Fig. 2, is less compelling than for 
the ASR technique, but it can still be useful. 

It should be noted that the phase vocoder technique can perform 
time-scale variations of speech signals simply by playing back the 
signal which has been frequency-scaled by a factor g at (1/q)-times 
the original speed. This restores the original frequency range but scales 
the signal’s time duration by gq. This useful property is not shared by 
the ASR technique because of the way the pitch harmonics are shifted 
and because of the nonlinear scaling of the amplitude signals. On the 
other hand, the asR technique can be useful for restoring speech 
distorted by a helium atmosphere, where scaling of the formants 
without changing the perceived pitch of the signal is desired.’* 

We turn now to the more recently developed time-domain harmonic 
scaling (TDHs) technique.* Although this technique is most efficiently 
implemented in the time domain, it was formulated and derived within 
the stFrT framework. 


2.3 Time-domain harmonic scaling 


As noted in the discussion on the phase vocoder technique, com- 
pression factors of up to 3 can possibly be obtained even if the 
bandwidth of each pitch harmonic is not scaled, provided that the 
pitch harmonics are shifted to the correct frequency locations. In the 
phase vocoder this necessitates scaling the phase derivative of the 
STFT so that AQ,, the frequency deviation of the pitch harmonic in the 
kth sub-band from the center frequency w,z, is scaled by g. The 
approach taken by the TDHs technique is to incorporate pitch infor- 
mation which is obtained by a separate pitch detector, into the scaling 
process.’ If the pitch frequency is known, the bandwidth of each 
bandpass filter can be made equal to the pitch frequency and the 
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center frequency of each bandpass filter can be aligned with the 
corresponding pitch harmonic, so that AQ, = 0 for all the bandpass 
filters which cover the speech band. Here, in principle, the number of 
bandpass filters also varies with the pitch frequency and is equal to 
the numberof pitch harmonics in the given speech band. Hence, if 
only shifting ‘of the pitch harmonics is desired, as schematically shown 
in Fig. 3 (for g = 1/2 and q = 2), without scaling the pitch-teeth 
bandwidth, there is no need to modify the modulating amplitude and 
phase signals (i.e. the stFT), but only to scale the carrier, or center, 
frequencies. Thus, 


Xq(wz, t) = X (wr, t), (13) 
with the understanding that w, is chosen to coincide with the pitch 
harmonic Q, in the kth sub-band. Using eq. (13) in eq. (8), and 
assuming that no synthesis filters are used, the output-scaled signal 
yq(t) is given by 


yq(t) = » X (we, thexp(Jquzt). (14) 
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Fig. 3—Schematic spectral representation of frequency scaling by shifting of the pitch 
harmonics.” 
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Equation (14) can be used to derive an explicit linear relation between 
y(t) and x(t) by substituting the right-hand side of eq. (5) for X (wz, ¢) 
in eq. (14). Discretization of this relation yields the TDHS algo- 
rithms,** which were successfully applied in conjunction with several 
waveform coding systems for an effective reduction in the required 
transmission bit-rate of speech signals.*”’ Like the phase vocoder, the 
TDHs technique can also be used for time scaling of speech signals by 
playing back the signal at a different speed or, equivalently, at a 
different sampling rate. 

Before we conclude this section, we would also like to mention two 
additional time- or frequency-scaling systems. One is the scaling sys- 
tem devised in Ref. 24 to which we will refer in a later section; the 
other is the recent cat method to which we already referred in Section 
2.1.°° In the car method a constant Q analysis filter bank is used (i.e., 
the w,(t)’s in Fig. 2 are not identical) which resembles the type of 
spectral analysis performed by the ear. Originally the modification 
consisted of scaling only the unwrapped phase in each band.® However, 
since some bands may contain more than one-pitch harmonic, the 
amplitude signals may vary significantly and need to be scaled as well. 
[See (a) of Fig. 7 in Ref. 19.] As mentioned earlier, the approach 
proposed in Ref. 9 is to analyze the amplitude signal in each band with 
another bank of filters and to scale the phase signals. 


ll. FREQUENCY-DOMAIN HARMONIC SCALING TECHNIQUE 


The basic FDHS technique is given in Ref. 12. In this section, we 
relate this technique to the techniques described in the previous 
section and give its details, including modifications which we introduce 
in the original sign tracking algorithm.” 

As in the phase vocoder, the FDHS technique aims at scaling the 
individual pitch harmonics of voiced speech signals. However, in FDHS, 
the total phase is scaled (including the constant phase term) which is 
discarded in the phase vocoder technique. Also, as in other narrow- 
band techniques the amplitude signals remain unmodified. 

Therefore, the modified STFT amplitude and phase components are 
given by 


Agr(t) = Az(é), (15a) 
and 
dgr(t) = gor(t). (15b) 


This type of modification has been the underlying modification of 
several early techniques which are described and analyzed in Ref. 21. 
The more recent techniques reported in Refs. 24 and 25 are also based 
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on this modification, if only frequency compression or expansion is 
considered. However, the way the FDHs technique performs the phase 
modification is specific to this technique as is elaborated in the 
following. 

For noninteger values of g, the phase modification in eq. (15b) must 
be performed on the unwrapped phase. Otherwise, phase ambiguity of 
a multiple of 27, which results from computing the principal value of 
the phase, may give rise to an incorrect modified phase value. In the 
phase vocoder technique, the difficult task of explicit phase unwrap- 
ping is avoided by directly computing the phase derivative from the 
real and imaginary components of the stFT and their time derivatives.” 
In the AsR technique, the scaling factors are restricted to g = 1/2 for 
compression and g = 2 for expansion.® The explicit phase division by 
2 for g = 1/2 is then avoided by expressing the scaled signal in each 
band in terms of the input signal and its envelope (amplitude function), 
with a sign which is determined by means of a simple sign tracking 
algorithm.’ 

Since compression and expansion by a factor of 2 is of most practical 
interest (speech quality at higher scaling factors degrades rapidly), the 
approach of using 2:1 scaling and avoiding explicit phase computation 
and unwrapping, by means of a proper sign tracking algorithm, is 
adopted also by the FDHs technique presented in Ref. 12. We now 
present the details of the FDHS technique and the modifications intro- 
duced in the original sign tracking algorithm. 

Let az(¢) and b,(t) be the real and imaginary parts, respectively, of 
the input signal sTFT, X (w,, f), and @q:(t), b,2(t) the corresponding real 
and imaginary parts of the modified STFT X,(wz, t). From eqs. (6) and 
(15) we find, 


ax(t) = Az(t)cosdz(t); Aga(t) = Ax(t)cos[g¢x(t)]. (16a) 
b(t) = Az(t)sinds(t); bgx(t) = Ax(t)sin[g¢x(t)]. (16b) 


By restricting q to be 1/2 or 2 and using basic trigonometric relations, 
the following algorithms were derived in Ref. 12. 


3.1 Compression (q = 1/2) 
Using half-angle trigonometric relations, one finds from eq. (16) 
axj2(t) = SGN[az,2(t) ]{[Az(t)/2][Az(t) + a2(é)]}"”, (17a) 
by a(t) = SGN[be/2(t)]{[Ax(t)/2][Ax(t) — ax(t)]}}', (1 7b) 


where the signs [SGN(-)] are determined according to the quadrant in 
which the complex vector (or phasor) [az/2(t) + jbz/2(t)] is present at 
any given time instant ¢. Once initialized, a consistent sign, or quadrant, 
tracking algorithm is given by the following rules or logic.” 
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Fig. 4—Demonstration of the validity of the change of sign rules in eq. (18): (a) 
SGNf ag alt + At)] = -—SGN[az,2(t)]; (b) SGN[bz/2(¢ + At)}] = — SGN[bz/2(t)]. 
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where A and B are logical variables such that 
A is true if a,(t + At) <0, 
B is true if SGN[6,(¢ + At)] = — SGN[Ox(¢)]. 


The validity of these rules is demonstrated in the phasor diagram 
shown in Fig. 4. In this figure, the solid-line vectors represent the input 
STFT at times ¢ and ¢ + Ad, with a corresponding phase change of 
Ady = AQ,At, where AQ, is the deviation of the harmonic from the 
center frequency w;. The dashed-line vectors represent the modified 
STFT and rotate at half the angular velocity. In (a) of Fig. 4 we illustrate 
a situation in which the input vector crosses the real axis; that is, 
condition B is “true.” Since the real part of the input vector is negative 
(i.e. condition A is also “true’”), the rule in eq. (18a) states that the real 
part of the modified strT changes its sign during the time interval At, 
as is indeed the case—the dashed-line vector crosses the imaginary 
axis. In the same way, (b) of Fig. 4 illustrates a situation in which the 
imaginary part of the modified strT changes its sign in agreement with 
the rule in eq. (18b). Similar diagrams can show that the rules are valid 
even if the direction of rotation of the phasors shown in Fig. 4 is 
reversed.* 


* The direction of rotation depends on the location of the pitch harmonic with respect 
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Originally, it was proposed to initialize the sign tracking algorithm 
described above by assigning positive values to the signs of az,/2 and 
bp/2 in all bands,” or for that matter, any arbitrary assignment will do 
as well. However, further analysis shows that such an initialization can 
result in shifting some pitch harmonics to incorrect frequencies and 
thereby breaking up the harmonic structure. This situation results 
when the signs of a@z/2 and bz/2 in a given band happen to be initialized 
such that the sign of one is correct and the sign of the other ‘is re- 
versed. The sign tracking algorithm in eq. (18) will then Gries The 
scaled signal vector (phasor) to reverse its direction of rotation. This 
can be verified with the help of a phasor diagram such as in Fig. 4. For 
simplicity, let the input signal be of the form Aexp(jQot), such that 
Qo = we + AQ, where w, is the center frequency of kth band, and AQ 
=< Aw (the bandwidth of that band). Then, the inversion of rotation 
direction discussed above will result in a scaled signal of the form 
Aexp[j(w:/2 — AQ/2)t] = Aexp[j(Qo/2 — AQ)t] instead of the desired 
signal Aexp[7(w,/2 + AQ/2)t] = Aexp[j(Qo/2)t]. The situation can be 

even worse when two adjacent filters share a single harmonic;* that is, 

the harmonic lies in the transition bands of the two filters as schemat- 
ically shown in (a) of Fig. 5. In this case, the following conditions may 
arise: (t) The sign initialization in both bands is correct so that the 
components from the two bands sum up to 


Ajexp[J(w1/2)¢ Jexp[j(AQ1/2)t] + Asexp[7(w2/2)t]exp[—7(AQ2/2)£] 
= (Ai + Ao)exp[jo/2)¢], 


where w; and we: are assumed to be the center frequencies of the two 
bandpass filters under consideration, AQ; and AQ: are the deviations of 
the given harmonic, of frequency %o, from w; and w., respectively, and 
A; and Ag: are the corresponding signal amplitudes in each band. For 
a uniform filter-bank design, Ai + A2 = A, where A is the input signal 
amplitude. This is the desired result as depicted in (b) of Fig. 5. (zi) 
The sign initialization in one band, say the second one, is incorrect but 
such that only one of the signs of az,2 or bz,2 is reversed. As discussed 
earlier, if we use phasor description, this will reverse the direction of 
rotation of the scaled component in that band and will result in 


Axexp[_j(w:/2)t]exp[ j(A%:/2)t] + Arexpf j(w2/2)t Jexp[ j(AM2/2)t] 
= Ajexp[ j(%%/2)t] + Acexp[j(Qo/2 + AQ2)¢]. 


to the center frequency «, of the sub-band in which it is located (i.e. on the sign of 
AQ,). If AQ; > 0, the phasor rotates in a counterclockwise direction, as in Fig. 4, whereas 
if AQ: < 0, its direction of rotation is reversed. 

* It is assumed that the side lobes of the filters in the filter bank are sufficiently small 
so that only two adjacent filters which share the same harmonic are considered. 
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Fig. 5—Schematic representation of the effect of sign initialization on frequency 
compression (q = 1/2) of a harmonic shared by two adjacent filters. (a) Original 
harmonic. (b) Condition (i): correct sign initialization. (c) Condition (11): sign inversion 
in one band of one of (a@z/2, bz/2). (d) Condition (iii): sign inversion in both bands of one 
of (az/2, bz/2). (e) Condition (iv): sign inversion in one band of both (ax/2, bx;2). 


This result is shown in (c) of the figure. (iii) The sign initialization in 
both bands is incorrect, and in both bands it is such that only one of 
the signs of az;2 or bz/2 is reversed. In this case, as shown in (d) both 
components are shifted to incorrect locations. (iv) The signs in one 
band, say the second one, are initialized incorrectly, such that both 
signs of az/2 and b,/2 are reversed. In this case the rotation direction 
can be shown to remain unchanged, but there is a 7 phase shift in the 
phase of the scaled component in this band. Hence, when the two 
components from the two adjacent bands are recombined in the 
synthesis stage, the following scaled signal results 
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(Ay — Ao)exp[j(Mo/2)¢]. 


This is shown in (e). As will be discussed later, designing the filters for 
minimal overlap, or narrow transition bands, mitigates the amplitude 
interactions. (v) The signs in both bands are initialized incorrectly, but 
in both bands both signs of az/2 and b;,2 are reversed. In this case there 
is a 7 shift in both bands and this results in a constant 7 phase shift in 
the recombined scaled signal as well, resulting in 


—(A; + A2)exp[y (Qo/2)t] = —Aexp[j(Mo/2)t]. 


This situation is of no concern. It may affect the compressed signal 
waveform but is not found to have any perceptual effect (if we listen 
to the time compressed version of the signal). Furthermore, in the 
expansion process the phase is multiplied by 2 and this constant phase 
shift is cancelled. Similarly, if the harmonic is located within a single 
band, a sign inversion is produced in the scaled harmonic, which is of 
no concern. 

We have seen that certain errors in sign initialization can have an 
adverse effect on the resulting scaled signal, and particularly so when 
a harmonic is shared by two filters. The best approach for initialization 
would appear to be the use of phase unwrapping along the frequency 
axis at the time of initialization, so that correct initial signs are assigned 
to each band. However, this is generally a difficult and error prone 
task, which typically requires a high-frequency resolution.”* Also, since 
speech is nonstationary, reinitialization is needed quite often and a 
complex phase unwrapping technique will defeat the whole purpose of 
using a simple sign tracking algorithm. Therefore, we have examined 
the implications of using the principal value of the phase in each band 
for assigning the initial signs of az/2 and bz/2. To do this, there is no 
need to actually compute the phase but only to examine the sign of 
b,(t) at the instant of initialization. Since the principal value of the 
phase is assumed to be in the range 0 to 27, the initial position of the 
vector [az/2(t) + 7bx/2(t)] must be assumed to be in the first or second 
quadrant. Hence, a sign initialization according to the principal value 
of the phase is given by the following simple rules: 


SGN[az/2(to)] = = rear oe (19a) 


SGN[bi/2(to)] = 1. (19b) 


An analysis of this sign initialization shows that either both signs of 
Qx2 and by/2 are correct or both are reversed. If the harmonic is located 
within a single filter, the scaled harmonic will be shifted to the correct 
frequency with a possible phase shift of z — which is of no concern. 
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However, if the harmonic is shared by two adjacent filters, the sign 
initialization according to eq. (19) can give rise to conditions (iv) and 
(v) discussed above. In summary, we face a problem only when a 
harmonic is shared by two adjacent filters and condition (iv) exists. As 
shown in (e) of Fig. 5, under this condition the harmonic is shifted to 
the correct frequency location but its amplitude is generally attenuated 
and it can even be canceled, if A; = Ae. This possibility of attenuation, 
or even cancellation, is still of concern. After further study of the 
problem, we found it possible to devise a procedure for matching the 
signs in the two bands so that no attenuation of the scaled signal will 
occur. (A z phase shift of the recombined harmonic is still possible, 
but as explained earlier this is of no concern). We now explain this 
sign matching procedure. 

For the purpose of simplifying the explanation, consider again a 
single input complex tone at frequency Q, of the general form 
Aexp[j(Qot + 0) ], with Qo satisfying w; < Qo < w2, where w; and we are 
again the center frequencies of two adjacent bands. Following phase 
scaling and sign initialization according to eq. (19), the modified sTFT 
signals X1(t) = X,(w:, t) and Xo(t) = X_(we, t) (with g = 1/2) are given 
by 


Xi(t) = Sx,Aiexp[J(AQit + do) /2], 
alt) = SeAvexp[—j(AMot — do)/2], 


where A; and A: are the magnitudes of the components in the two 
bands. Further, A; + Az = A, AQi = Qo — 1, AQe = we — Qo, and 
S,,, Sx, can only take the values of (+1) or (—1) and are used to denote 
a possible sign inversion because of an incorrect sign initialization. 
Therefore, the problem is to make S;, and S,, equal to each other so 
that the two components will recombine without attenuation, a sign 
inversion being inconsequential. To perform this matching of 
S,, and S,, we examine the function R(¢) given by 


R(t) 4 Xi(t)/X2(t) = Sr(Ai/A2)exp[j(Aw/2)e], (20) 


where Aw = w2 — w, = AQ, + AQ, is the fixed frequency separation 
between the centers of the two given bands, and 


gs, a] 1 ifS,=S,, 
B | -1. if Si, S:,' 


Our goal then is to have a sign initialization for which Sr = 1. Since 


R(t) is independent of the unknown value of Qo, we can find the value 
of Sr by examining R(t) at any of the specific time instants ¢t = ¢, = 
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nT» = n(27/Aw), (where n is an integer) which results in 


R(tn) = Sr(A1/Az2)(-1)”. (21) 
Hence, Sz can be determined from* | 
Sr = SGN[R(t,)](-1)". (22) 


In practical situations where the filters are not ideal, such as when 
leakage from other bands is present, and the signal is not purely 
periodic, R(t) at ¢ = ¢, is not necessarily real as is implied in eq. (21). 
Therefore, we use the sign of the real part of R(t,); that is, 


Sr = SGN[Re{R(é,)} ](—1)”. (23) 


To evaluate the right-hand side of eq. (23), there is no need to fully 
compute R(t,), as shown below. Let Xi(tn) = a1 + JB: and Xo(t,) = 
a2 + jB2—for convenience, the explicit time dependence of a, fi, a, 
and £2 is suppressed. Then, by the definition in eq. (20), 


SGN[Re{R(é,)}] = SGN(ara2 + B82). (24) 


Hence, assuming that we know two bands which share a single 
harmonic, the complete initialization procedure (at an appropriate 
time instant ¢, = nTo) consists of first initializing the two signs of az/2 
and bz,/2 according to eq. (19) and then evaluating Sr from eqs. (23) 
and (24). If Sr = 1, the two components are matched. If Sr = —1, the 
signs in one band should be inverted. If the signal is stationary, the 
_above initialization procedure needs to be done only once as the correct 
sign will continue to be tracked by the sign tracking algorithm in eq. 
(18). However, even for the stationary case one has to first determine 
which are the two bands which share a single harmonic because, in 
general, both bands on the two sides of the band to be initialized may 
contain signal components. One way to find out which is the correct 
pair of bands is to compute the phase of R(t,) for each of the two 
bands and pick the band which has the smaller phase angle. Again, 
there is no need to explicitly compute the value of the phase of R(¢,) 
for the two bands in question, but it is sufficient to compute the ratio 
(21 — 0182) /(aia2 + 8:82) for each of the two possible pairs, and pick 
the pair for which this ratio has the smaller magnitude. However, in 
simulations we have found that, given sufficient frequency resolution 
(relative to the separation between harmonics), it is also adequate to 
simply choose the band in which the signal magnitude is larger. 

In addition to the above pairing issue, since speech is nonstationary 


* By choosing ¢, to be a multiple of 27, the term (—1)” can be avoided. However, for 
better tracking of pitch frequency variations we prefer to reduce the time interval 
between possible sign initializations to To. 
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one is also faced with the problem of detecting the following conditions: 
onset of speech, transitions from unvoiced to voiced, and the appear- 
ance of a pitch harmonic in a band because of pitch variation. In all of 
these cases, the signs should be reinitialized according to the procedure 
discussed above. We have found in simulations that satisfactory au- 
tomatic tracking of the above conditions is obtained by reinitializing 
the signs in any band k for which 


|X (wa, tr) |/| X (we, tri) | = Ex, (25) 


where E77 is a preset threshold (typically, as elaborated in Section V, 
it is set to correspond to an energy increase of 10 dB in the time 
interval J> = 16 ms). The mechanization of this initialization process 
is further detailed in Section 5.1. 


3.2 Expansion (q = 2) 


From eq. (16) we have 


d2z(t) = Az(t)cos[2¢,(t)], (26a) 
and 
bo,(t) = Az(t)sin[2¢,(t) ]. (26b) 
Using double-angle trigonometric relations and eq. (26) one finds, 
Gox(t) = {2[ax(t)]}’ — [Ax(t)P}/Aalé), (27a) 
and 
bop(t) = 2 az(t)b,(t)/Az(t), (27b) 


where it is assumed that Az(t) ¥ 0. If A,(¢) = 0, then, directly from eq. 
(26), Qoz(t) = boe(t) = 0. 

It is seen that because the phase is multiplied by an integer (gq = 2), 
a phase ambiguity which is a multiple of 27 is of no concern in 
frequency expansion. Note also that if the compressed signal is ex- 
panded (for signal reconstruction), Az, az, and bz, should be replaced 
in eq. (27) by Azz, Azz, and bz,/2, respectively. 

In comparing the sTFT modifications, as expressed by eq. (15) for 
this technique and by eq. (10) for the asR technique, one may question 
if it is useful to use eq. (10a) in place of eq. (15a). The answer is no. 
The reason is that, again, with practical filters a pitch harmonic may 
be shared by two adjacent filters. Hence, if a nonlinear modification, 
~ such as that in eq. (10a), is applied to the amplitude signal in each 
band, the two signals (which are components of the same harmonic) 
will in general not recombine to the correct magnitude. Again, since 
the analysis is narrow-band the scaling of the amplitude signal is 
generally of secondary importance. 
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IV. EFFICIENT DISCRETE-TIME IMPLEMENTATION 


The basis for the discrete time implementation of the FDHS tech- 
nique is the general block diagram for frequency scaling in Fig. 2. 
However, in this form (following discretization) it is highly inefficient 
because a large amount of computation is needed to perform the 
filtering analysis and synthesis operations. It has been demonstrated 
in several works'*-!*”* that a discrete Fourier transform (DFT) formu- 
lation, using the FFT algorithm, can be used for STFT analysis and 
synthesis with its accompanying large saving in computation. We 
particularly found the weighted overlap-add (WOLA) method given in 
Ref. 14, to be most suitable for our application. However, we had to 
extend the particular scheme given in Fig. 2 of Ref. 14 to accommodate 
situations in which both the analysis and synthesis windows (the 
prototype low-pass filter impulse responses) have durations longer 
than the transform size N. This arises from the need to use analysis 
filters with narrow transition bands, and, hence long duration, when 
performing frequency compression. This design minimizes filter over- 
lap and lowers the probability of obtaining harmonics shared by 
adjacent filters, and, therefore, lessens the effect of possible incorrect 
sign initialization in such bands. 

We begin by showing that a discrete-time version of the block 
diagram in Fig. 2 of this paper has the basic form considered in Ref. 
14, so that the woLA scheme developed is indeed suitable for our 
application.” 

Figure 6 shows a discrete-time form of the block diagram in Fig. 2 
which is made to match Fig. 3 in Ref. 14 (with somewhat different 
notation). This requires some clarification, which is given next. 

The discrete-time input signal x(m7J') represents samples of x(t) 
which is assumed to be sampled at or above its Nyquist rate, with T 
being the sampling interval. The signal band is divided into N equally 
spaced contiguous sub-bands with center frequencies w, = 27k/(NT), 
k=0,1,---,N-—1. The discrete-time STFT, X (w;, nT’), is obtained for 
each value of k, by modulating x(nT) by exp(—jw.nT) = Wy", where 


Wn A exp(j27/N), (28) 


and filtering the modulated signal by the low-pass filter h(nT), which 
is the sampled version of w;(t). Here all N prototype filters w,(t) are 
identical. Since h(nT) is approximately band-limited to Aw/2 = 7/NT, 
its output signal can be decimated. To reduce frequency-domain 
aliasing, and because of considerations related to the sign tracking 
algorithm used by the FpHS technique, the decimation factor R, an 
integer, typically satisfies R < N. The decimated sTFT, X (w,, nRT), is 
modified according to the modification algorithms of the FDHS tech- 
nique, using discretized forms of eqs. (17) and (27) for compression 
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Fig. 6—Discrete-time form of the block diagram in Fig. 2, including decimation and 
interpolation of the sub-band signals. 


and expansion, respectively. The modified sTFT, X,(wrz, nRT), can be 
considered to be the sTFT of the desired scaled signal at the scaled 
center frequency qu», and as being sampled at a rate which corresponds 
to the scaled signal bandwidth, i.e. with a sampling interval T’ = T/q. 
It is suitable, therefore, to rename X,(wz, nRT) as Y(quwz, nk’T’), 
where R’ is related to R through the requirement R’T’ = RT (i.e., R’ 
= gR), so that the original time scale is maintained. For synthesizing 
the output scaled signal from the decimated and modified STFT signals, 
Y(qux, nR’T’), k = 0, 1, -»-, N — 1, these signals must first be 
interpolated by a factor R’. The interpolation is done in the scheme 
shown in Fig. 6 by inserting (R’ — 1) zeroes between adjacent samples. 
This is represented by the box labeled 1:R’ in Fig. 6. The result is 
processed with a low-pass filter f(nT’) having a nominal bandwidth of 
a/(NT’) 

tes the interpolation is not ideal, we denote the interpolated STFT 
by Y(qun, nT’). Modulation of the base-band signals with the complex 
sequence exp(jqw,nT’) results in the complex bandpass signals 
Zqx(nT’), R= 0,1, ---, N — 1. Using w, = 2ak/NT and T’ = T/q, we 
note that exp(jqw,nT’) = Wx, where Wy is defined in eq. (28). Thus, 
the input and output modulating sequences are complex conjugates of 
each other and are identical to the discrete transform kernels used in 
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the DFT. Finally, by summing the N complex bandpass signals the 
frequency scaled output signal y,(nT’) is obtained. 

Thus, we have seen that the general block diagram given in Fig. 2 
can be implemented in the discrete-time form shown in Fig. 6. From 
the identity between this figure and Fig. 3 in Ref. 14, the more efficient 
WOLA scheme presented in Ref. 14 can be directly applied for the 
implementation of the block diagram in Fig. 6. However, as mentioned 
earlier, it is import t that frequency compression with the FDHS 
technique be performed with long duration window functions (longer 
than N). Hence, the scheme shown in Fig. 2 of Ref. 14 needs to be 
modified to accommodate this paticular situation. For clarity of pres- 
entation, and since the scheme shown in Fig. 2 of Ref. 14 can be used 
for expansion without any change, we briefly explain this scheme 
before we show its modification. 

For convenience, we denote the analysis and synthesis window 
sequences by h(n) and f(n), respectively, dropping the explicit sam- 
pling intervals used in the previous notation. We now duplicate in Fig. 
7 the scheme shown in Fig. 2 of Ref. 14, with some changes in notation 
to match the notation used in this paper. 

According to this scheme, the input data samples are shifted into an 
input data buffer of length N, R samples at a time, corresponding to 
the R:1 decimation shown in Fig. 6. The input data block is weighted 
by h(—n), which has its origin at the center of the block (see Fig. 7). 
The weighted data block is transformed using the FFT algorithm. The 
resulting STFT has its time reference at the beginning of the block (i.e., 
a sliding time reference); hence, a linear phase shift, corresponding to 
the time interval between the fixed time origin, say, f = 0, and the 
beginning of the transformed data block, must be introduced. In 
addition, since the time origin of the analysis window is at the center 
of the block, a circular rotation of the weighted data by N/2 points is 
also needed. Performing this rotation by phase modification in the 
frequency domain results in an overall phase modification by 
(—1)*Wn™, k = 0, 1, ---, N = 1, as shown in Fig. 7. The result is the 
desired discrete-time STFT, X (w,;, nT), in a fixed time reference, which 
is to be modified for frequency scaling. The modified sTFT 
Y(qwrz, nR’T’) is translated back to the sliding time reference by 
applying the complementary phase modification (—1)*Wi"”. The out- 
put scaled signal is obtained by inverse transforming the modified STFT 
(in the sliding time reference), weighting the resulting data block by 
the synthesis window f(r), and overlap-adding the weighted block to 
the output buffer. The output buffer shifts out R’ samples for every R 
samples that are shifted into the input buffer. Also, to facilitate the 
overlap-add operation, R’ zeroes are shifted into the output buffer as 
the processed signal samples are shifted out. Since the input and 
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Fig. 7—A wo.a block implementation scheme for short-time Fourier analysis, mod- 
ification, and synthesis." 


output sampling intervals are T and T’, respectively, the time scale is 
not altered and the flow of data is uninterrupted. However, if time 
scaling is desired, the output data can be stored first and then replayed 
at the original sampling rate, resulting in a time-scaled signal (by the 
factor g = R’/R) which occupies the original frequency band. 

We turn now to the more general situation in which h(n) and f(n) 
have longer durations than N samples. Let L, and L; denote the 
durations of A(n) and f(n), respectively, and let L, = m,N, Ly =m;N, 
where m, and my; are positive integers. (If necessary, zeroes can be 
appended to the impulse responses to satisfy these conditions.) The 
appropriate scheme for this case is shown in Fig. 8 and is explained 
below. 

It has been established in earlier works, as well as repeated in 
Ref. 14, that if L, > N, the N data points to be transformed are 
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Fig. 8—Modified implementation scheme to accommodate analysis and synthesis 
windows which are longer than the transform block size N. 


obtained by time aliasing the weighted L, data points into N points. 
This can be seen as a stack-add operation of the m, data segments, 
each of duration N, as shown in Fig. 8. Because of the stacking 
operation, the time origin of the transform is seen to be aligned with 
the time origin of the data window, and, hence, no circular rotation by 
N is needed; that is, there is no need to multiply the transformed data 
by (—1)*. As before, the phase modification by Wy"”* is needed to 
obtain X(w,z, nRT) in the fixed time reference. Following the sSTFT 
modification, the conversion of the sliding time reference, and the 
inverse transformation, N data points are obtained to which the 
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synthesis window of duration L; = m;N is to be applied. To do this, 
we periodically repeat the given block of N data points m; times.* To 
be consistent with the analysis window weighting, the center of the 
synthesis window is aligned with the beginning of the given data block, 
as shown in Fig. 8. The weighted data is then overlapped-added to the 
output data buffer as before. 

Before we conclude this section, several comments are in order. 
First, it should be noted that the scheme in Fig. 8 can be used also for 
the case considered in Fig. 7; that is, when the windows have a length 
which is less or equal to N. This is done by appending zeroes on each 
side of each window so as to make the duration of each 2N (i.e., my 
= my, = 2). The stack-add operation will provide the circular shift by 
N/2 points, as needed in Fig. 7, and hence eliminate the need for 
multiplying by (—1)*, as is the case in Fig. 8. 

Second, it is observed that the implementation schemes presented 
above can, in principle, be used for scaling an input signal by any 
rational factor R’/R. However, for frequency compression with q = 
1/2, the FDHS technique provides a particularly efficient realization of 
the STFT modification because the sign tracking algorithm avoids the 
need for explicit phase unwrapping. Expansion by integer factors is 
the simplest operation because the principal value of the phase can be 
used and the filter design considerations are simple. For other rational 
scaling factors it appears that one has to resort to phase unwrapping 
with its attendant complexity. 

Third, in noncoding applications it may be desired to obtain a 
frequency compressed signal at the original sampling rate, i.e., over- 
sampled. The needed interpolation can be embedded in the above 
implementation schemes. This is done simply by enlarging the modi- 
fied STFT transform size (1/g times) by padding with zeroes (at the 
center of the transform block). Following the inverse transform (IDFT) 
and the weighting by a suitable f(m) of the longer data block, it is 
overlapped-added to the output data buffer. The data in both the 
input and output data buffers is accordingly shifted by R samples at a 
time. 

Finally, it is of interest to point out that the scheme shown in Fig. 
8 offers a generalization of the TDHS technique‘ as explained below. 

Let us apply the principles of THs, as elaborated in Section III, to 
the scheme in Fig. 8. In principle, this means using a pitch-adaptive 
analysis filter bank—N is made equal to the pitch period and h(n) is 
varied accordingly—and applying no modification to the STFT, except 
for scaling the center frequencies, as expressed by eq. (13) in Section 
III. Since the sTFT is not modified, there is actually no need to use a 


* This takes into account the underlying periodicity in the IDFT result because of the 
discretization in frequency. It can also be concluded from eq. (14) in Ref. 14. 
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transform—the phase shifts can be done by circular rotations in the 
time domain. The part of the scheme in Fig. 8 between the stacking 
and the weighting by f(r) collapses to a counter clock wise circular 
rotation in the time domain by (R’ — R)n Mod N, samples, where N, 
is the pitch period. The implementation scheme of the generalized 
TDHS technique is shown in Fig. 9. 

The TDHS algorithms in Ref. 4 correspond to specific choices of R, 
R’, and f(n). For example, for 2:1 compression the values are R = 2, 
R’ = 1 and f(n) is a unit impulse; that is, f(n) = 6(n). The new 
generalized TDHS scheme above offers more flexibility through the 
possible use of more general synthesis windows, f(n). It also could 
provide means for additional interband filtering such as w,,(t) in Fig. 
2, which could improve the quality of the scaled speech signal, although 
at the expense of additional computation. This generalization of the 
TDHS technique is now under further investigation. 


V. DESIGN CONSIDERATIONS AND SIMULATIONS 


The implementation scheme in Fig. 8 applies both to compression 
and expansion. However, the type of frequency scaling performed 
affects the design requirements for the analysis/synthesis system. For 
this reason, we discuss below the design of the compression and 
expansion systems separately. 
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Fig. 9—Implementation scheme of the generalized TDHS technique. 
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5.1 Compression system design 


A basic assumption in the development of the FDHS technique is 
that each filter in the analysis filter bank contains no more than one 
pitch harmonic. Hence, the number of filters in the filter bank should 
equal or exceed the largest number of harmonics expected in the given 
speech band. To accommodate low-pitch male speakers, we have 
chosen the number of filters to be N = 128. With a sampling rate of 8 
kHz, which is typical for telephone bandwidth speech, this corresponds 
to anominal frequency resolution of 62.5 Hz. This may not be sufficient 
resolution for very low-pitch speakers but was chosen as a compromise 
because of another conflicting requirement elaborated below. The 
prototype low-pass filter h(n) has, therefore, a nominal bandwidth of 
31.25 Hz. Besides the usual requirement that the transform of h(n) 
have low side lobes to minimize interband leakage, it is extremely 
important that the transition band of h(n) be as narrow as possible to 
minimize the overlap between the bandpass filters. The importance of 
minimizing the overlap stems from the difficulties in identifying bands 
which share a harmonic with an adjacent band and in matching the 
signs in those bands to avoid attenuation of the corresponding har- 
monic, as we discussed in detail in Section 3.1. By reducing the overlap 
between the filters, the probability of occurrence of this condition is 
lowered. In addition, it is also lowered by using the minimum number 
of filters necessary to separate the pitch harmonics. For this reason, 
we did not increase the transform size to 256 and have chosen N to be 
128. For female voices, the best results are obtained with N = 64. 
However, since WN is fixed the value of N = 128 was found to match 
both male and female voices. Other considerations in designing h(n) 
are as follows. 

In principle, one can reduce the transition-band width by increasing 
the filter length L;,. However, since speech is not a stationary signal 
and has a typical quasi-stationarity interval of several tens of ms, the 
length of h(n) should be limited accordingly. Therefore, we have 
limited its length to 512 samples, 1.e., to a duration of 64 ms for 8-kHz 
sampling rate, with the main lobe duration being 32 ms. With L, = 512 
we have m, = L;,/N = 4 and, for this reason (mz > 1), it was necessary 
to develop the scheme in Fig. 8, as an extension of the scheme in Fig. 
7 from Ref. 14. To meet the requirements for low side lobes and a 
narrow transition band, we have chosen h(n) to be an optimal equirip- 
ple filter?’ and designed it with the filter design package” available on 
our computer system.* 


* The maximum filter length that can be designed on our system is 511. A 512-point 
filter is formally defined by appending a zero. 
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An additional requirement on the analysis filter bank is that its 
overall response be uniform. This way the two components of a 
harmonic which is shared by two filters will sum up to the correct 
magnitude. In terms of h(n), a necessary and sufficient condition for 
the filter bank to be uniform is” 


nian) = {5 n=0 


0 n¥0° oy) 


While this condition is easily satisfied by windowing for example a 
sin(x)/x type response, the optimal equiripple filter does not necessar- 
ily satisfy this condition. Therefore, we had to use a trial and error 
approach and repeat the design several times until our design goals 
were met. Figure 10 presents the results of the final design which we 
denote by Ao(n). The analysis window ho(n) is shown in (a) of Fig. 10, 
its frequency response in (b), and the composite filter bank frequency 
response in (c). The peak-to-peak ripple of 0.2 dB in the composite 
response is quite acceptable. The composite frequency response of a 
filter bank having a prototype low-pass filter h(n) is simply found by 
transforming the sequence d(n) defined by 


_ Jh(n) ifn Mod N=0 
d(n) = {° otherwise : (30) 


Before we consider issues related to the modification of the STFT 
signals, we would like first to consider the issues involved in designing 
a synthesis window f(n) that provides an adequate reconstruction of 
the input signal when no spectral modifications are performed. Since 
we deal with the case where L;, > N (i.e., mp; > 1), the synthesis filter 
has the difficult task of undoing the time aliasing, because of the 
stacking operation, as well as the frequency aliasing, because of the 
decimation of the signal in each band by the factor R. 

It was shown in Refs. 16 and 24 that for exact reconstruction the 
following relation between h(n) and f(r) should hold 


Y f(n — sR)h(pN —n+ sR) = 6(p), for alln, (31) 
where 6(p) = 1 for p = 0 and zero otherwise. If we assume that N is 
divisible by R, then eq. (31) is periodic in n with period R and 
constitutes a set of R conditions (for nm = 0,1, --- , R — 1). In particular, 
for R = N, it is seen from eq. (31) that the design of f(n) is equivalent 
to the problem of designing N inverse sub-filters.°** Since h(n) and 
f(n) are both of finite duration (FIR), exact reconstruction cannot be 
obtained. However, if R < N there is less aliasing in the frequency 
domain and the problem is relaxed. In view of the equivalent scheme 
in Fig. 6, we expect that for R < N the use of a reasonably good 
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Fig. 10—Designed analysis prototype filter 4o(7). (a) Impulse response. (b) Frequency 
response. (c) Composite filter-bank frequency response. 


interpolation filter will also result in an acceptable reconstruction 
error. Therefore, we have initially selected four interpolation filters 
which have simple analytical respresentations. This facilitates the 
variation of their bandwidth by a change of a parameter. These 
windows are symmetrical and can also be made to have the usually 
desired property of interpolation filters, namely, 


f(nR) = i vie (32) 


The four synthesis window functions considered are a rectangular 
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window [wr(n)], a Hanning window [wz(n)], a sin(x)/x function 
multiplied by a Hanning window [wus(n)], and a special window 
function [wy;(n)] which was derived in Ref. 4 and has some interesting 
properties.* The analytical representations of these windows are 


ste f In| <L. 


0 |n|=L’ 
wy(n) = ae + cos(zn/L)] |n|sL, 
33) 
0 Inj>L 
1 
wus(n) = 45 [1 + cos(27n/L,) | 
0 
-[sin(mn/L)/(an/L)] |n| < L;/2 ri 
|n| > L,/2° 
and 
cL, 
wa(n) = To ene |2| = L,/2 , 
0 \n| > L;/2 (35) 


where L is an integer which determines the bandwidth of the synthesis 
window and Ly is its length, which is assumed to be an even multiple 
of L. In particular, the rectangular and Hanning windows in eq. (33) 
have a duration of L; = 2L. Note that if L is chosen to be equal to R, 
then eq. (32) is satisfied. The above four window functions are shown 
in Fig. 11 for several values of L,/L. The reconstruction error expected 
from using these synthesis windows, with the analysis window h(n) = 
ho(n) shown in Fig. 10, was found by evaluating the left-hand side of 
eq. (31) and computing its deviation from the desired value on the 
right [6(p)]. An average mean-square error (mse) was defined as 
follows. Let V,,(p), n = 0, 1, --- , R — 1 be defined by the left-hand 
side of eq. (31) under the assumption that N is an integer multiple of 
R and let 


e,AY> [V.(p) —- G8(p)]?? n=0,1,---,R-1, (36) 
Pp 


* This window is described by eq. (53) in Ref. 4 (even case) and has the interesting 
property of satisfying an equation like (32) in both the time and frequency domains. 
Thus, as a prototype filter it results in a uniform filter- bank response, whereas in a 
WOLA-type operation it results in a uniform time response [see eq. (52) in Ref. 4]. For a 
a de choice of parameters, it becomes identical to the well-known Hanning win- 

ow. 
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Fig. 11—Synthesis window functions. (a) Rectangular [wr(n)] and Hanning [wi(n)] 
aa (b) wy(n) and wys(n) windows for L; = 4L. (c) wy and wus windows for L; = 
8L. 


where G is a constant given by 
1 R-1 
=— V,(0), 37 
R x (0) (37) 
and, in general, is not equal to 1 as assumed in eq. (31). We have 
introduced G since we consider a reconstruction which differs from the 
original signal by a constant gain term as being errorless. An average 
mse is now defined by 


f’=— Ye. (38) 


It can be shown that the choice of G according to eq. (37) minimizes 
e’; that is, de?/8G = 0. The values of €”, in dB, which were obtained for 
the above synthesis windows, using the analysis window fo(n) are 
shown in Fig. 12 as a function of R, which is given in terms of the 
transform size N. For each value of R, e” was computed for different 
values of L. The minimum value of e* so obtained is shown in Fig. 12. 
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SYNTHESIS WINDOWS 
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AVERAGE MSE €? IN DECIBELS 
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R 


Fig. 12—Minimum values of the average mse e’ for different synthesis windows, 
paired with the analysis window ho() in (a) of Fig. 10, as a function of the decimation 
factor R, or input data shift. N is the transform block size. 


In fact, for the above windows, the minimum is attained when L = RB; 
that is, eq. (32) is satisfied. 

Note also that in Fig. 12 the length of the synthesis windows wr(n) 
and wz(n) varies with R, since L = R and L; = 2L. However, the length 
of wus(n) and wy(n) is assumed to be fixed—either L; = 2N or Ly = 
4N—but the bandwidth still varies with R since L = R. It is seen from 
Fig. 12 that with all four windows R must be less than N. To save 
computation we wish, of course, to use the largest possible value of R. 
For R = N/2 and L; = 4N = 512 the preferred window is wys(n), 
whereas for R = N/2 and L; = 2N it is wy(n) (in both cases L = R = 
64). To reduce the output buffer size and to save computation, we 
preferred using L;= 2N. The resulting reconstruction error (below —30 
dB) is quite acceptable. The other two windows [wr(n) and wx(n)] 
result in reconstruction errors too high to be useful with this value of 
R. 

The above results are for an analysis-synthesis system without the 
modification of the sTFT signals. Since the frequency scaling affects 
the output sampling rate (the interpolation factor becomes R’ = gR, 
where q is the scaling factor) and the sign tracking algorithm limits 
the maximum value of R (as elaborated later), the choice of R and the 
synthesis window function must be carefully reconsidered to account 
for the spectral modification. 
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For proper operation of the sign tracking algorithm, the phase in 
each frequency cell or band should not change by more than 7/2 
between observations in order that the crossing of the complex signal 
vector from one quadrant to another will be accounted for and the 
signs of its real and imaginary parts be tracked correctly. The maxi- 
mum phase change occurs when a harmonic has the largest deviation 
AQmax from the center frequency. Taking in account the overlap 
between filters, we let AQmax = Aw, which is the difference between 
adjacent center frequencies. During R samples, the phase can change 
by up to AwRT and, hence, with Aw = 27/(NT) we get the condition 
that 27R/N = 7/2; that is, R <= N/4. Therefore, with N = 128 we have 
R s 32, and to save computation we choose R = 32. 

It is noted that this choice of R was dictated by the FDHS modifica- 
tion technique used and not by the analysis-synthesis system. How- 
ever, while a transform of the input data must be taken every 32 
samples for proper sign tracking, the modified STFT can actually be 
computed at a lower rate corresponding to a decimation factor Ry > 
R. This obtains because the analysis-synthesis system is capable of 
acceptable reconstruction for larger values of the decimation factor. 
Furthermore, since g = 1/2 (2:1 compression), the interpolation factor, 
or the output data shift, is R’ = Ry,/2 so that the modified STFT needs 
to be computed only every N = 128 samples. This corresponds to 
R’ = N/2 = 64 which, as shown earlier, can be accommodated by the 
analysis-synthesis system with an acceptable reconstruction error. 

In summary, the following windows and parameters are used. The 
transform size is N = 128; the analysis window is h(n) = ho(n) of Fig. 
10 (L, = 512); the input data shift is R = N/4 = 32; the sTFT 
modification is computed every Ru = N = 128 samples; the output 
data shift is R’ = Ry,/2 = N/2 = 64; and the synthesis window chosen 
is f(n) = wm(n) of eq. (35), with Ly = 2N = 256, and L = R’ = 64. This 
selection of parameters and window functions is supported by simu- 
lation results with both synthetic and natural speech signals as will be 
illustrated subsequently. 

Before we present the design considerations for the expansion sys- 
tem, we wish to explain an additional issue related to the selection of 
the synthesis window and give details of the strr modification. In Fig. 
2, the synthesis filters were assumed to have a bandwidth which is q 
times the bandwidth of the analysis filters. In the implementation 
described above, this corresponds to using synthesis filters with L = N 
and not L = R’ as we have chosen, which widens the synthesis filter 
bandwidth since R’ < N. However, with practical synthesis filters the 
use of L = N was found to cause inband attenuation and an increase 
in the reconstruction error. For R’ = N/2 = 64, the use of the above- 
selected window functions, wy(n) for Ly = 256 or wys(n) for Ls = 512 
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with L = R’ gave better or as good results as several other window 
functions having bandwidth in the normalized frequency range of 
a/R’ to 7/N. However, for smaller values of R’ an advantage was 
found in using L = N/2 > R’ because of the additional filtering 
provided by the synthesis window. 

We discuss now the details of the strT modification for frequency 
compression. The modification is performed in each band according to 
the expressions in eq. (17) at time instants t = nRuT, where Ry is the 
modification decimation factor, chosen to be Ry = N = 128. The sign 
tracking is done according to eq. (18) at a higher rate, namely at time 
instants t = nRT, with R = N/4 = 82. The condition for sign initiali- 
zation in each band is determined by eq. (25). The ratio on the left- 
hand side of eq. (25) is computed at time instants t = ¢, = nTo, where 
To = NT, which is also the STFT modification update interval since Ry 
= N. If the ratio exceeds the threshold value E7 in a given band, the 
signs in that band are initialized. On the basis of simulations, Er was 
chosen to correspond to an energy increase of 10 dB during the time 
interval JT) = 16 ms for 8-kHz sampling rate. This choice of Ev was 
found to indicate quite reliably the onsets of speech, the transitions 
from unvoiced to voiced, and the crossing of a pitch harmonic into a 
given band from a neighboring band. At the same time, this value was 
found to be sufficiently high to prevent reinitialization because of the 
normal amplitude fluctuations in each band during sustained voiced 
intervals. 

If the initialization condition is met, the initialization process con- 
sists, on the basis of the discussion in Section 3.1, of the following 
operations. First, the signs in all the cells, or bands, which need to be 
initialized are set according to eq. (19); that is, according to the 
principal value of the phase in each cell. Then, starting from the lowest 
frequency cell to be initialized, a “pairing” process is performed. That 
is, for each cell to be initialized one of its adjacent cells is picked for 
sign matching to provide for the situation in which a pitch harmonic 
is shared by two adjacent bands. This is done either by picking the 
band which gives an R(t,) [see eq. (20)] with a smaller phase angle 
(see Section 3.1) or, as preferred in our simulations and to save 
computation, by picking the band in which the signal magnitude is 
larger. Note that if cell o is to be initialized and cell ko + 1 is picked 
for sign matching, then, even if cell ky + 1 is also to be initialized, this 
cell is skipped in the-pairing and matching process since it was already 
chosen to be matched to cell po. Finally, the sign matching of the 
chosen pairs is performed by computing Sz for each pair, using eqs.(23) 
and (24), and inverting the signs in one of the bands of those pairs for 
which Sz is found to have a negative value. 


2140 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


5.2 Expansion system design 


Expansion by an integer factor avoids the phase ambiguity problems 
and is, therefore, a relatively easy task. The main requirement is 
sufficient frequency resolution in the spectrum analysis. Since there is 
no concern if the filters overlap, the resolution requirement is easily 
fulfilled by using even the simple Hanning window h(n) = [1 + cos(an/ 
N)]/2, nE[—N/2, N/2], with L;, = N = 256. The Hanning window also 
satisfies eq. (29) and, therefore, the resulting filter bank is uniform. 
Without the stFT modification, the analysis-synthesis system can now 
be made a unity system—no reconstruction error—since eq. (31) can 
be exactly satisfied. In particular, if f(7) is a rectangular window of 
length N, eq. (31) becomes 

Lr-1 
yy A(sR +n) =1 n=0,1,---,R-1, (39) 
s=—Lp 
where Lr = N/R, with R being assumed again to divide N. The 
condition in eq. (39) is satisfied within a constant gain factor by the 
Hanning window for any R = N/2 which divides N. The gain factor is 
Lr/2. The analysis-synthesis implementation then becomes the well- 
known overlap-add (OLA) implementation.'*””? 

Let us now consider the effect of stFT modification for frequency 
expansion on the analysis-synthesis system design. Since in this case 
q = 2, so that R’ = 2R, limiting R’ to N/2 requires limiting FR to N/4. 
However, since the rectangular synthesis window has high sidelobes in 
its frequency response, it did not provide sufficiently good speech 
quality when frequency expansion was performed. To improve the 
filtering provided by f(n), we have considered again using the window 
functions in eqs. (34) and (35). Very good quality expanded speech was 
obtained with R = N/8 = 32 and wm(n) of eq. (35), using L; = N = 256 
and L = N/2 = 128. For this choice of parameters, wy(n) becomes 
identical to the Hanning window wz(n). Note that the output data 
shift or interpolation factor is R’ = 2R = N/4, but L = N/2,i.e., L > 
R’, which is a condition that provides for additional filtering as dis- 
cussed in Section 5.1. 

To summarize, the expansion system is implemented with a trans- 
form size of N = 256 and analysis and synthesis windows that are both 
Hanning windows of length N. The input and output data shifts are 
given by R = N/8 = 32 and R’ = 2R = 64, respectively, and the STFT 
modification is done according to eq. (27) at time instants t = nRT, 
ie., every R = 32 samples. 


5.3 Simulations 


The FDHS system was simulated on a laboratory computer which is 
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equipped with an integral array processor (Data General-Eclipse AP/ 
130). The array processor facilitated fast computation of the needed 
array operations, such as transforms, windowing, stack-adding, and 
overlap-adding. The input signal was generally telephone bandwidth 
speech (200-3200 Hz) sampled at 8 kHz. In addition to natural speech 
input, a synthetic vowel with fixed pitch was used. It was synthesized 
by periodically repeating a single pitch period (51 samples) from the 
speech of a male speaker. This synthetic signal was valuable in the 
development of the system, in checking assumptions, and in selecting 
parameters and window functions. By way of illustration, and to 
support our selection of parameters, Fig. 13 shows in part (a) the 
spectrum of the synthetic vowel, and in parts (b) to (1) the spectra 
obtained for different windows and parameter values, as detailed in 
the figure caption and explained below. For reference, an ideally 
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Fig. 13—Spectral representation of original and processed synthetic vowel for differ- 
ent system parameters and window functions. (a) Original. (b) Ideal 2:1 compression. (c) 
Compression with the selected system parameters: N = 128, h(n) = ho(n), R = 32 
Ru = 128, R’ = 64, f(n) = wm(n) with L; = 256, L = 64. (d) Same as in (c), except 
Ru = R = 32. (e) Same as in (c), except Ru = R = 64. (f) Same as in (c) but f(n) = 
wus(n) with L; = 512, L = 64. (g) Same as in (c) but f(7) = wy(n) with L = 64 (L; = 128). 
(h) Same as in (c) but f(n) = wr(n) with L = 64 (Ly; = 128). 
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Fig. 13—(Contd.) (i) Expansion of the ideally compressed signal in (b) using the 
selected expansion system parameters: h(n) = wu(n) with L = 128, (Ly; = N = 256), 
R = 82, R’ = 64, f(n) = wa(n) with L = 128, (Ly = 256). (j) Same as in (i) but with 
f(n) = wr(n) (L = 128) and R = 64. (k) Same as (1) except R = 64. (1) Expansion of the 
compressed signal in (c) using the selected system parameters as in (i). 


compressed spectrum is shown in part (b). Since the synthetic signal 
is periodic with a known period, ideal 2:1 compression can be obtained 
by discarding every other period and reducing the sampling rate by a 
factor of 2. Practically, (b) is obtained by windowing (we used a 
Hamming window for the spectral analysis) a segment of speech with 
half the number of samples than in the segment analyzed for producing 
the spectra in (a). Since the input signal is exactly periodic, the shape 
of the spectral teeth is determined only by the frequency response of 
the window. In natural speech, which is only quasiperiodic, the ampli- 
tude and phase modulations of the pitch harmonics, as discussed in 
Section II, widen further the spectral teeth. This can be observed in 
later illustrations. 

In (c) of Fig. 13 the spectrum of the compressed signal, using the 
selected system parameters detailed in Section 5.1, is shown. Because 
the signal is purely periodic, the results are particularly good. A 
frequency domain signal-to-distortion ratio (SDR) computation* results 


* The frequency-domain distortion measure used is based on the mse between the 
spectral envelopes of the input and processed signals, as measured in sub-bands with a 
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here in SDR = 42 dB. To illustrate that no harm is done by using 
Ru > R (see Section 5.1), part (d) shows, for comparison with (c), the 
result of using Ry = R = 32. The difference is minute—only 0.1 dB 
higher spR between the processed and ideal spectrum. Part (e) shows 
the spectrum obtained when Rk = Ry = 64 is used. The problem in this 
case is that for some harmonics the sign tracking is inadequate because 
R is too large. The effect on the spectra is clearly seen. Part (f) shows 
the result obtained when a longer, L; = 512, synthesis window is used. 
On the basis of Fig. 12, wus(n) is now preferred and used. The results 
are somewhat better than in (c), an improvement of 2 dB in spr, but 
this does not seem to justify the doubling of the output buffer size and 
the additional computation. Parts (g) and (h) reaffirm the unsuitability 
of the simple Hanning and rectangular windows, respectively, as syn- 
thesis windows in the given compression system. The loss in SDR 
amounts to 18 and 28 dB, respectively, as compared to (c). 

The remaining parts of Fig. 13 illustrate some of the results obtained 
with the expansion system. Part (i) shows the spectrum that results 
from expanding the ideally compressed signal in (b), using the selected 
system parameters as detailed in Section 5.2. As expected, the results 
are better than for compression and the frequency-domain sDR is 
above 60 dB. Parts (j) and (k) show the results obtained with rectan- 
gular and Hanning synthesis windows, respectively, using R = 64. It is 
clearly seen that this value of R is too large and its use results in 
significant spR reductions for both windows—up to 35-dB reduction 
for the Hanning window, as compared to (i). For R = 32, the selected 
Hanning window performs best, as seen in (i), offering an 8-dB advan- 
tage in SDR over the rectangular window. 

Finally, part (1) shows the resulting spectrum when the compressed 
signal in (c) is expanded back, using the selected system parameters as 
in (i). The reduction in spR because of the expansion process was 
found to be only about 0.5 dB; that is, a 41.5-dB spDR is obtained when 
the spectrum in (1) is compared to the spectrum in (a). 

The importance of correct sign initialization is demonstrated in Fig. 
14. In addition to showing the ideally compressed spectrum of the 
synthetic vowel in (a)—identical to part (b) of Fig. 13—it also shows 
the spectrum of the frequency compressed signal obtained with three 
different sign initialization approaches. Part (b) of Fig. 14 shows the 
result obtained when all the signs are initialized to be positive; part 
(c), when the signs are initialized according to eq. (19) (i.e., using only 


bandwidth that increases with frequency, similar to articulation index measurements. 
The actual band allocation used was taken from Ref. 31. This measure was found useful 
for the synthetic signal, as it correlated well with our observations. For natural speech, 
however, this was not so and we had to rely on subjective listening and visual exami- 
nation of the spectral representations. 
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Fig. 14—Demonstration of the effect of different sign initialization approaches. (a) 
Spectrum of the ideally compressed synthetic vowel. (b) All-positive sign initialization. - 
(c) Sign initialization according to principal value of the phase in each band. (d) Sign 
initialization according to the algorithm developed in Section 3.1. 


the principal value of the phase); and part (d), according to the final 
initialization algorithm which includes the pairing and matching op- 
erations described earlier and also shown in (c) of Fig. 13. The results 
indeed validate the assumptions and analysis performed in Section 
I1i—namely, the possible generation of undesired spectral components, 
in addition to the attenuation of the desired components. Part (b) 
shows the result when a random or all-positive sign initialization is 
used; part (c), the possible cancellation of harmonics because of incor- 
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rect matching of the signs in two adjacent bands which share the same 
harmonic; and part (d), the improved results obtained by using the 
developed sign initialization algorithm. 

Similar results were obtained with natural speech, although errors 
in the initialization do occur due to speech nonstationarity and devia- 
tions from the harmonic model. Yet, in all the simulations performed, 
the developed sign initialization algorithm always resulted in better 
speech quality. As an illustration, Fig. 15 shows the spectra obtained 
for a segment of voiced speech. Parts (a) to (c) show the spectra of the 
original, compressed, and reconstructed signal using the developed 
FDHS system. For comparison, parts (d) and (e) show the corresponding 
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Fig. 15—Spectral representation of original and processed voiced speech segment for 
male speaker. (a) Original. (b) Compressed 2:1 using FDHS. (c) Expansion of compressed 
signal (reconstructed) using FDHS. (d), (e) Same as (b) and (c), respectively, but using 
TDHS. 
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results obtained with the time-domain harmonic scaling (TDHS) tech- 
nique‘ using a cepstral pitch detector®’ implemented on the same array 
processor. The TDHS system was judged to give higher quality speech. 
Further discussion on the comparison between the two systems is 
given later. For an additional demonstration of the results obtained by 
the two systems, Fig. 16 shows the corresponding time waveforms, and 
Fig. 17 presents spectrograms of the complete processed sentence. To 
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Fig. 16—Time-domain waveforms corresponding to the spectral representations given 
in Fig. 15. 
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FREQUENCY IN KILOHERTZ 


illustrate the capability of the two systems to change the time scale, 
without changing the frequency scale, parts (f) and (g) of Fig. 17 show 
spectrograms of the time-compressed signals. 


5.3.1 Performance with degraded inputs 


To examine the robustness of the FDHS technique to adverse acoust- 
ical environment conditions, we ran simulations with noisy speech 
signals (down to 0-dB s/n), speech with severe room reverberation, 
and speech from three speakers speaking simultaneously. In simula- 
tions with noisy speech, the FDHS system appeared to be quite robust 
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Fig. 17—Spectrograms of original and processed speech by FDHS and TDHS systems 
(‘We were away a year ago,” male speaker.) (a) Original. (b) Frequency compressed— 
FDHS. (c) Frequency-expanded (reconstructed)—FpDHs. (d) Frequency-compressed— 
TDHS. (e) Frequency-expanded (reconstructed)—TDHs. (f) Time-compressed—FDHs. (g) 
Time-compressed—TDHS. 
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Fig. 18—Spectral representation of original and processed noisy voiced speech seg- 
ment [white noise at 6-dB s/n added to signal in (a) of Fig. 16]. (a) Original. (b) 
Reconstructed—FDHS. (c) Reconstructed—TDHS. 


with only some reduction in signal crispness as compared to processing 
clean speech. No change in the general nature of the noise was 
observed. This is in contrast to results obtained with the TDHs tech- 
nique which tends to structure the noise caused by the pitch-depend- 
ent, time-domain weighting process. This is illustrated in Fig. 18 which 
shows the spectra of the original noisy signal (white noise at 6-dB s/n) 
and the reconstructed signals by the FDHS and TDHS systems. The 
structuring of the spectra, according to the speech pitch, caused by 
the TDHS technique is evident. Although this structuring provides a 
filtering effect similar to comb filtering, the structured noise is usually 
more annoying to listen to. For further illustration, Fig. 19 presents 
spectrograms of the complete processed noisy sentence by the two 
systems. 

The results of processing speech with severe room reverberation, 
and multiple speakers speech, indicate that the FDHS system is highly 
robust to these conditions. In fact, these conditions tend to mask 
processing artifacts and the reconstructed signal sounds very similar 
to the original. In the TDHS system, the structuring of the uncorrelated 
reverberation components is perceivable. On the other hand, the TDHS 
system was found to be quite robust to multiple speakers speech.° 
Spectrograms of a processed sentence, recorded with severe room 
reverberation, are shown in Fig. 20, for the two systems. 
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To summarize, by the above simulation results we have demon- _ 
strated the validity of our assumptions in deriving the sign initialization 
algorithm and the selection of the analysis-synthesis window functions 
and parameters. The FDHS system was found to be robust to environ- 
ment conditions, although its quality for clean speech signals is judged 
to be lower than with the TDHS. On the other hand, the robustness of 
the TDHS system is largely dependent on the type of pitch detector 
used (particularly with noisy signals), and it suffers from artifacts 
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Fig. 19—Spectrograms of original and processed noisy speech by FDHS and TDHS 


systems. Parts (a) through (e) as in Fig. 17, but with white noise at 6-dB s/n added to 
original speech signal. 
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Fig. 20—Spectrogram of processed speech with severe room reverberation by FDHS 
and TDHS systems [““The birch canoe slid on the smooth (plank),” male speaker]. (a) 
Input signal. (b) Compressed—FDHS. (c) Reconstructed—FpHS. (d) Compressed—TDHS. 
(e) Reconstructed—TDHS. 


introduced by structuring wide-band noise or uncorrelated reverbera- 
tion. 

It should be mentioned, however, that while the FDHs system, in 
general, provided good communications speech quality, an increase in 
reconstructed signal degradation was found for some low-pitch male 
speakers, typically below 80 Hz. One possible reason is insufficient 
frequency resolution of the analysis filter bank. At the expense of a 
slight input bandwidth reduction, this condition can be corrected 


FREQUENCY SCALING 2151 


somewhat by reducing the sampling rate from 8 kHz to 6.4 kHz since 
increasing N from 128 to 256 is not desirable as explained earlier. 
Another source of degradation appears to be the fact that if the pitch 
teeth are wide, and the pitch frequency is low, the original speech is 
characterized by a small separation of the pitch teeth, and the model 
assumed in the derivation of the FDHs technique is not as applicable. 
Further study of this problem is needed. 


Vi. A HYBRID. TDHS-FDHS SYSTEM 


The simulation results presented in Section V and our earlier expe- 
rience with TDHs*~ indicate the advantage of TDHS over FDHS, provided 
that the acoustical environment conditions allow adequate pitch ex- 
traction, the noise structuring is acceptable, and pitch data can be 
transmitted. If pitch data cannot be transmitted, as would be the case 
with analog channels or digital channels with tandeming of waveform 
coders, or if it is not desirable to transmit the data, use of TDHS 
requires reextraction of the pitch at the receiver. Since pitch extraction 
from the compressed signal is more difficult because fewer pitch 
periods per unit time are available, the quality of the reconstructed 
signal is degraded. Since expansion alone with the FDHS system pro- 
vides almost transparent speech quality without the need for explicit 
pitch extraction, we examined the possibility of using a hybrid system, 
such as shown in Fig. 21. In this system the compression is done by 
TDHS and the expansion by FDHS. Simulations using this system 
supported this approach, and the overall speech quality obtained was 
judged to be better than by TDHS without pitch transmission or by 
FDHS alone. For illustration, Fig. 22 shows spectrograms of a processed 
sentence by the different systems. It should also be noted that the 
proposed hybrid system has an advantage with noisy signals as well, 
as long as pitch extraction at the transmitter is feasible. The advantage 
is that the structuring of the noise in the reconstructed signal is 
avoided, since this structuring occurs mainly in the expansion stage of 
the TDHS, which is replaced by FDHs in the hybrid system. Further- 
more, the hybrid system should also be more tolerant to channel errors 
than TDHs alone since these errors appear as noise in the compressed 
signal which does not affect the FDHS expansion system much, but 
could obviously affect the TDHS expansion process. 


Vil. CONCLUSION 


A unified description of several frequency scaling techniques has 
been given in terms of the short-time spectral modifications which 
they produce. This description helps in understanding the properties 
and limitations of these techniques and their relation to FDHS. 
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Fig. 21—Block diagram of hybrid frequency scaling system TDHS-FDHS. (a) Digital 
channel. (b) Analog channel. 


The results of this work, with respect to the FDHS technique,* 
provide a substantial improvement of the earlier version of this tech- 
nique as reported in Ref. 12. The improvement is manifested in both 
the quality achieved, and in implementation efficiency. The improve- 
ment in quality is the result of using a filter bank with higher frequency 
resolution and less overlap between filters, as well as the use of the 
dynamic sign initialization and matching algorithm developed in the 
present work. The improvement in implementation efficiency is 
achieved by the use of a block implementation of the short-time 
Fourier analysis-synthesis system. In this system, the FFT, the embed- 
ded decimation and interpolation, and the WOLA synthesis scheme— 
described in Ref. 14 and extended here to include the case of analysis 
and synthesis window having longer duration than the transform size— 
provide a large saving in computation. 

It was seen that the introduction of stFT modifications can greatly 
affect the characteristics of the analysis-synthesis system and its 
design. The detailed design considerations of the window functions 
and the selection of the decimation and interpolation factors given 
here can be useful also in other applications involving spectral modi- 


* An early presentation of the results was given in a talk that included an audio tape 
demonstration.” 
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Fig. 22—Spectrograms of original and processed speech by FDHS, TDHS, and hybrid 
TDHS-FDHS systems [“This is a computer test (of a digital speech coder),” male speaker. ] 
(a) Original. (b) Reconstructed—Fpus. (c) Reconstructed—Tpus. (d) Reconstructed— 
TDHS with pitch reextraction from compressed signal. (e) Reconstructed—hybrid TDHS- 
FDHS system. 


fications of signals, such as in some of the techniques described in 
Section II. 

The developed FDHS system is particularly amenable to an array- 
processor implementation. From simulations of the system for a variety 
of adverse acoustical environment conditions, the FDHs technique 
appears to be robust and provides reconstructed speech of good com- 
munications quality. The main degradation, as mentioned earlier, is 
introduced in the compression stage, since the expansion operation 
provides almost transparent quality. Unlike the simpler TDHs tech- 
nique, the FDHs is not explicitly dependent on pitch extraction and is, 
hence, more robust. However, for clean speech, compression with TDHS 
results in better speech quality than with FpHs. On the other hand, for 
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noisy speech signals, in addition to possible failure of the pitch detector 
at high-noise levels, the TDHS expansion process tends to structure the 
noise which can be perceptually annoying. 

In applications where pitch extraction at the transmitter is feasible 
but where pitch data transmission is to be avoided, the hybrid TDHS- 
FDHS system, in which compression is performed by TDHS and expan- 
sion by FDHS, provides better overall speech quality than the TDHs or 
FDHS systems alone. The additional advantages of the hybrid system, 
such as reduction of noise structuring and higher immunity to channel 
errors, as compared to TDHS alone, and the lower complexity, as well 
as higher quality, as compared to FDHS alone, singles out the hybrid 
system as the best solution for a variety of applications. 

An interesting outcome of the general implementation scheme, 
shown in Fig. 8, is the generalization of the TDHs technique to include 
both analysis and synthesis windows. This generalization has potential 
for further improving the TDHS performance. 

The FDHS technique was developed on the basis of the quasihar- 
monic nature of voiced speech. Deviations from this model cause a 
reduction in processed speech quality. To achieve higher than com- 
munications quality with the FDHS technique, further study and un- 
understanding are needed of the simultaneous amplitude and phase 
modulation processes of the individual pitch harmonics, and of the 
nonstationary characteristics of speech signals. 
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McDonald’s Problem—An Example of Using 
Dijkstra’s Programming Method 


By D. K. SHARMA 
(Manuscript received April 20, 1981) 


We use Dykstra’s programming method to solve the so-called 
McDonald’s problem and show how to rigorously introduce file input/ 
output operations in the program. The steps involved are quite simple 
and the paradigm suggested is applicable to the wider class of 
problems that involve sequentially processing file records. For 
these problems, the programs developed using the data structure 
design methodology are generally considered to be the most desir- 
able. We show that Dijkstra’s method can yield the same program. 
Unlike other methodologies, it also yields a correctness proof, 
which is extremely valuable in understanding the program and in 
modifying it. 


l. INTRODUCTION 


In this paper, we use Dijkstra’s method of simultaneously developing 
a program and a proof of its correctness to solve the so-called Mc- 
Donald’s warehouse problem. The problem briefly is to read a card file 
and print an inventory report. Our interest in it stems from the fact 
that it requires sequentially processing the records of a file—a task 
common to a large class of problems. As we solve the problem, we 
illustrate how to rigorously introduce file input/output operations 
within the framework of Dijkstra’s method. This aspect of the solution 
is intended to be a paradigm that is applicable to the above-mentioned 
' class of problems. 

Our solution serves one other purpose. The McDonald’s warehouse 
problem is often used to compare the effectiveness of different pro- 
gramming methodologies in developing programs that sequentially 
process files. As discussed in Ref. 1, the programs developed using the 
data structure design methodology are generally considered to be the 
most desirable. This is primarily because the structure of the resulting 
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program closely reflects the structure of the input data. The program 
developed in this paper is identical to that developed by the data 
structure design methodology, except that the latter has no correctness 
proof associated with it. The proof-related assertions in a program are 
not only helpful in understanding it, but also in systematically modify- 
ing it. 

The solution discussed below is quite simple and regular—as a 
paradigm it can be systematically applied to other similar problems. It 
is obtained in three steps. In Step 1, we develop the program and its 
proof assuming that the records of the file are available in an array. 
This version of the program also uses a few symbols that are related 
to its proof, assuming that their values are readily available. These 
assumptions are made purely for the sake of convenience in developing 
the program and are removed in the next two steps. In Step 2, we (7) 
introduce additional program variables, (iz) modify the assertions to 
reflect the introduction of the new program variables, and (iii) add 
appropriate statements that make the assertions hold. Thus, we are 
guaranteed that the program remains correct through all the modifi- 
cations. This is done to eliminate from the program text the symbols 
whose values are not readily available. In Step 3, we introduce the file 
operations. This is done by replacing suitably chosen initialization 
statements by openfile and by replacing certain other groups of assign- 
ment statements, by readfile or writefile. 

Section II presents the problem, first informally and then formally. 
This is followed by the three steps of the solution in Section III and by 
a summary in Section IV. 


Il. PROBLEM SPECIFICATION 


McDonald’s food warehouse receives and distributes food items. 
Each shipment received or distributed is recorded on a punched card 
that contains the name of the item and the change in the quantity of 
the item due to that shipment. The change is recorded as a positive 
integer when items are received and negative otherwise. These cards 
are alphabetically sorted according to item names by another program. 

The problem is to write a program to read the sorted card file and 
print an inventory report. The report should show the net change in 
the inventory of each item transacted and the number of distinct food 
items transacted. At this point, the reader may wish to devise his or 
her own solution and later compare it with the solution derived below. 

The following is a more formal statement of the problem, which is 
used in developing the solution in Section III. 

Assume that the transaction file contains M — 1 cards, and the ith 
card contains two fields, f(z) and g(t), where 1 =i = M — 1, f(t) isa 
positive integer representing the name of the food item on the ith card, 
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and q(t) is an integer representing the change in the quantity of f(z). 
Note that, without any loss of generality, we have assumed f(z)’s to be 
integers instead of identifiers. 

The file has been sorted in the nondescending order of f(z)’s. For the 
sake of discussion, we augment the file with an extra card signifying 
“end of file’ (EOF) condition for which f(M) = EOF and q(M) = 0. 
The value EOF is assumed to be greater than all the other /(z)’s to 
maintain the sorted nature of the file. 

Clearly, all the cards of a particular food item are grouped together 
in the file. Let N be the number of distinct food items, that is, the 
number of groups in the augmented file. Let m, be the index of the 
first card of the nth group, where 1 <n S N. That is, 


m =1 
and m, = min{t:m-1<isM and f(i—1)<f(i)} forl<n=N. 
The m,’s have the following property 
1=m <me+-++<my-1<mn=M. 


Define p, to be the net change in the quantity of the nth food item, 
where 1 = n < N. We do not define py, which corresponds to the 
fictitious card used to augment the file. We can express p, as 

P= YX qi) forlsn<N. 
Note that p,’s are the values to be printed in the report. 

We can now define the goal of the program as: Print N — 1 lines 
such that the nth line contains the name of the nth food item [(i.e., 
f(m,)] and the net change in the quantity of the food item, i.e., Dn. 
Then print a line containing N — 1, the number of food items trans- 
acted. Note that N is the number of groups in the augmented file. 


ill. SOLUTION 


In the following, we first develop a program to print the first N — 1 
lines of the report and add the last line later. 
We assume that the reader has at least a cursory knowledge of the 
methodology described in Refs. 1 and 2. See Ref. 3 for a brief tutorial. 
We develop the iterative solution in three steps. In Section 3.1, we 
assume that N is known, and the following are available as arrays: 
f(t) and q(t) for 11M, and 
m for 1sisQN. 


In Section 3.2, we remove the above assumptions one by one, as 
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described in Section J. This is done by introducing new program 
variables, etc., while still maintaining program correctness. This cul- 
minates in a program in which N need not be known, and only one 
element each from the arrays f(z) and qg(i) appears in the loop. 

In Section 3.3, we identify statements that can be replaced by file 
operations. This substitution is quite mechanical and yields a program 
that sequentially reads the card file and prints the first N — 1 lines of 
the report. The post-assertion of this program is then used to print the 
last line of the report. 


3.1 Step 1 


In the notation of Ref. 1, the result assertion is given by 
RI: (Ai: 1<1< WN: p; has been printed). 


This should be read as follows: for all i such that 1 =i < N, p; has been 
printed. 

The iterative statement will be the main part of the program. Its 
loop invariant P1 is obtained by weakening the result assertion R1, 
that is, replacing the constant N by a variable n. Thus, we have 


P1:(At:1si<nsN: p; has been printed) 
and (PlAn=N)=RI1. 


The first version of the program is as below. 
Solution 1.1 


n:= 1; {P1} 
don#~N—- 

Increase n under the invariance of P1. 
od {PlAn=N}. 


This program begins with an initialization step that trivially establishes 
P1. The loop increases n and keeps P1 invariant; therefore, at the end 
we can assert P1 A n = N, which is equivalent to F1. 

To show termination, choose the termination function 


t=N-n. 


The value of ¢ is initially N — 1, each iteration of the loop reduces it, 
and ¢t = 0. The loop, therefore, must terminate. 
We now add a statement to increment n: 


n := 1; {P1} 
don #N—> 
S1; {Q} 
ni=n+1 {P1} 
od {PlAn=N}. 


2160 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


Here, Q is the “weakest precondition such that the execution of ‘n := 
n+ 1 will establish P1.” It is obtained by replacing all occurrances of 
nin P1 by n + 1, and the resulting expression is denoted by P1|i*’. 
Thus, 


Q = up(“‘n:=n+4+ 1”, Pl) = Pl, 
where wp(S,P) is the weakest precondition in which execution of S 


will establish P. 


The statement S1, starting execution in state P1, must establish 
P1|z*". This can be simply done by computing the value of p, and 
printing it. Thus, S1 can be refined as: 
S1: {P1} 
S2; {sum = pn A P1} 
print (sum) {P1|7*"}, 


where the program variable sum has been introduced. 
We now refine S2. It has the property 
{P1} S2 {R2 and P1}, 
where 


R2:sum=pr= YY q(t). 


m,Sl<mM, +1 


Statement S2 will be an iterative program, and to get its loop invariant 
P2, replace the constant m,41 by a variable m. Thus, 


P2: sum = y q(t) 


and (P2A m= mn) = R2. 
Using the same technique as before, S2 is refined as 
S2:m := mn; sum := 0;{P2} 
do m ¥ mn+1—> 
S38; {P2| mn" 
m := m + 1{P2} 
od {P2 A m = mati}. 


Notice that the initializations establish P2 in the beginning and the 
post-assertion is equivalent to R2. Statement S3 must have the prop- 
erty 


{P2} S83 {P2|z"5 


and can be easily shown to be sum := sum + q(m). 
This gives us Solution 1.2, after assembling all the pieces. 
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Solution 1.2 


n:= 1;{P1} 
don # N> 
m:= m,; sum := 0; {P2 Am =m,} 
dom A Mn+1—2> 
sum := sum + q(m); 
m:= m+ 1{P2} 
od; {P2 Am = myy3} 
print (sum); 
n:= n+1{P1Am=m,} 
od {(PlIAm=m,An=N}. 


We have added “‘m = m,,”” to the assertions where it happens to hold. 
This is indicated in bold and is used in the next section. 


3.2 Step 2 


Solution 1.2 is unsatisfactory since it explicitly uses N, m,, and mni1, 
which are not available a priori. We modify it in the following to 
eliminate this deficiency. 

These modifications are done by (i) introducing additional program 
variables, (ii) slightly modifying the assertions to reflect the introduc- 
tion of new variables, and (zi) adding appropriate statements to make 
the assertions hold. Thus, the modified program is guaranteed to be 
correct. We believe that this technique is applicable not just to this 
problem but also to the wider class of problems wherein files are 
processed sequentially or where the initial versions of the programs 
refer to quantities not readily available. 

We make the above-mentioned three changes as follows. In the first 
change, we eliminate the assignment statement m := m,. Note that the 
rest of the outer loop maintains m = m, invariant; so we could add this 
term to the loop invariant P1 and eliminate the assignment statement. 
An extra initialization statement, m := 1, would then become necessary 
to establish the new loop-invariant in the beginning. The program is 
still correct. See Solution 2.1 below. 

In the second change, we modify the outer loop guard n ¥ N. Since 


(n # N) =[f(m,) 4 f(my)], 
my = M, and 
m=mMn 


hold before the outerloop, its guard can be replaced by f(m) ¥ f(M). 
This change does not affect any of the assertions. 

In the third change, we modify the inner loop guard, m ¥ mni1. For 
Mm, =m <My+1; 
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(m # Mn+) = [f(m) = f (mn)] . 


Therefore, we can use f(m) = f(m,) as the guard in place of m ¥ mn+1, 
without disturbing anything else. We now replace f(m,) by a program 
variable F’; the guard becomes f(m) = F. The meaning of the inner 
loop is kept unchanged by requiring F = f(m,) to be true before the 
inner loop. This requirement is trivially met by adding the assignment 
F':= f[(m) at that point; notice that m = mz, is true before the loop, as 
discussed above. 

The above additions appear in bold print in the following program. 

Solution 2.1 


n:= lm: 1;{Pl Am=m,} 
do f(m) + f(M)—> 
sum := 0; F := f(m); {P2 A m =m, A F = f(m,)} 
do f(m) = F—> 
sum := sum + q(m);m:= m + 1{P2} 
od; (P2 Am = mani} 
print (sum); 
n:= n+1{P1Am=m,} 
od {Pl1Am=m,An=N}. 


This program, still correct, does not explicitely use N or m,; notice 
that f(M) is the special value EOF. They are, however, an integral 
part of the proof and the assertions. The two assignment statements 
involving n could be removed from the program without affecting it. 
But we retain them, as the final value of n is of interest in printing the 
Nth line of the report. 


3.3 Step 3 


Solution 2.1 uses f(m) and q(m) as if they are available as arrays, 
but they are not. We eliminate their use in two steps and introduce file 
operations instead. This technique is useful not only in solving Mc- 
Donald’s problem, but in a wider class of problems that involve 
sequential file processing. . 

In Step 1, we replace f(m) and g(m) by the variables f and g, 
respectively. This necessitates asserting f= f(m) and g = q(m) before 
the statements where the substitution is made. Just as in the previous 
section, the assertions are made to hold by introducing appropriate 
assignment statements. 

The two above assertions can become false only after a statement 
that modifies m. Therefore, we introduce the assignment statements 
f:= f(m) and q := q(m) after each statement that modifies m. Solution 
2.1 has only two such instances. The program after these modifications 
is as follows. 
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Solution 3.1 


n:=1;m = 0; 
m:=m+1;f = f(m);q:=q(m); {P2 Am=m, AA} 


do f  f(M)—> 
sum := 0; F := f; {P2 Am =m, A F = f(m,) A A} 
do f = F> 


sum := sum + q; 

m:=m +1; 

f = f(m); 

q = q(m){P2 A A} 
od; {P2 Am=m, A A} 
print (sum); 
n:=n+1{P1Am=m,A A} 

od{P1 Am=m, An=N}, 


where A is f = f(m) A gq = q(m), and the initialization of m has been 
broken up into two statements in the beginning of the program. 

We now introduce the file operations in this program: replace m := 
0 by openfile; m =n + 1, f = f(m); g = q(m) by read(f, g); and f(M) 
by EOF. Also, n equals N at the end of the program, so we can add the 
statement to print n — 1, the number of items transacted. The resulting 
program, without the assertions, is as follows. 

Solution 3.2 


n := 1; openfile; read(f, q); 
do f # EOF—> 
sum := 0; F = f; 
do f = F> 
sum := sum + q; 
read(f, q) 
od; 
print (sum); 
ni=n+1 
od; 
print (n — 1). 


This solution is identical to that obtained by the data structure 
design technique. It is considered to be a desired solution: its structure 
reflects the problem closely, it does not treat any card specially, it 
neatly handles all the groups one after the other, and it can be modified 
to add special processing at the beginning or at the end of a group.” 


lV. SUMMARY 


In this paper, we solved the McDonald’s warehouse problem to show 
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how to effectively use Dijkstra’s method to develop programs that 
process records in a file sequentially. 

The solution was developed in three steps. We first assumed that 
the records of the file were available in an array, and developed the 
program disregarding the file operations. This program also used 
certain symbols whose values are not readily available. These symbols 
were removed in the next step by introducing additional program 
variables and modifying the program under correctness assertions so 
that the next step could be carried out. Finally, we replaced one 
initialization statement by openfile, and selected groups of statements 
by readfile or writefile. The example discussed in this paper involved 
only reading a file, but the same techniques apply when a file is 
written, too. 

With a proper choice of invariants, the programs thus developed are 
comparable to those obtained by the data structure design, a technique 
that is considered to yield good programs for such problems. 
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of Time-Compressed TV Signals in a Time- 
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Time-compression multiplexing (TCM) has recently been proposed 
for application in multiple TV transmissions through satellites. It is 
advantageous over frequency-division multiplexing because of its 
relative immunity to nonlinear transponder effects. Here we study 
two important and fundamental aspects of TcM—the spectral prop- 
erties and band-limiting effects on the time-compressed signal. We 
derive the output spectrum of a time-compressed signal and show 
that if the original input signal is assumed to be: (i) band-limited to 
B Hz, (ti) segmented into T-second intervals before time compression 
by a factor of a(a = 1), and (iii) 1/T « B, then essentially all the 
spectral power in the output time-compressed signal is contained in 
the bandwidth |f| <= aB Hz. This result is applicable to the Tv case. 
Numerical examples on various types of spectra are also presented. 
Using the TV example, we further demonstrate that the ripples created 
by low-pass filtering the time-compressed signal up to aB Hz are 
small, and interburst interference due to these ripples can be kept 
negligible with a small guard time (about 2 percent of the burst 
duration) between different signal bursts. We also provide a brief 
discussion on some interesting spectral properties of time-compressed 
signals in spectrum-expansion applications. 


|. INTRODUCTION 


Time-compression multiplexing (TcM) is a technique whereby mul- 
tiple signals can be multiplexed together in a common communication 
channel for transmission.’” A simple illustration of this method is 
shown in Fig. 1 where x(t) is a continuous waveform intended for 
transmission. It is first divided into segments of T seconds each; and 
each segment is time compressed by a factor a(a = 1), resulting in a 
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Fig. 1—A simple illustration for TCM. 


bursty signal y(t) with a burst duration of T/a seconds. A total of a 
such time-compressed signals can then be time multiplexed together 
for transmission. In the particular case of TV transmissions through 
satellites, TCM is advantageous over frequency-division multiplexing 
(FDM) because various degrading effects (e.g., intermodulation and 
intelligible crosstalk) due to transponder nonlinearities can be avoided 
by employing tTcm. In a more general context, TcmM is more efficient 
than FDM whenever time division can be accomplished more efficiently 
than frequency division. In this paper, we study two important and 
fundamental aspects of Tcm—the spectral properties and the band- 
limiting effects of the time-compressed signal. 

If we assume that the original signal x(¢) in Fig. 1 is band-limited to 
B Hz, then time compressing it by a factor of a in the infinite time 
duration, i.e., transforming x(t) to x(t/a), would mean a frequency 
spectrum expansion by the same factor (a). However, as shown in the 
diagram, x(t) is segmented into 7-second intervals before time com- 
pression on each segment. Doing so, it is no longer obvious what the 
spectrum should look like or what the bandwidth expansion factor 
would be. It is clear though that the spectral power in y(¢) is nonzero 
beyond aB Hz due to the segmentation; and it is desirable that this 
power beyond aB Hz be small to maintain spectral efficiency in TcM. 
We derive and discuss an explicit expression for the output spectrum 
of the time-compressed signal y(t) (see Section II) and show by 
numerical examples (Section III) that all the significant power is 
contained in the frequency bandwidth below aB Hz, thus, confirming 
the long-speculated result that the bandwidth expansion factor in TCM 
is the same as the time-compression factor. 
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To ensure compliance with the out-of-band emission requirements, 
signals are often filtered before transmission. Such a band-limiting 
operation on the time-compressed signal truncates its small but non- 
zero power beyond its passband (aB Hz) and creates ripples in its time 
waveform. The ripples following the trailing edge of each burst are 
important because they lead to interburst interference in the system. 
We demonstrate using a computer simulation (Section IV) that in the 
specific case of TV transmission, (%) such a band-limiting effect is 
minimal as long as all the spectral components below aB Hz are 
transmitted without distortion and (iz) the interburst interference can 
be kept negligible by introducing a small guard time in the order of 
two percent of the burst duration between different time-compressed 
TV signals. These encouraging results on both the bandwidth expansion 
and band-limiting effects assure us of the basic attractiveness of using 
TCM to transmit TV signals in nonlinear satellite channels. 

It is well-known that time compression can also be used as a means 
to obtain spectrum expansion, e.g., Henry’s spectrum expander used in 
radio astronomy.’ In such a case, the key concern is that of spectrum 
distortion as analyzed thoroughly by Rowe.* We extend our results to 
examine this problem in an appendix, and some simple and interesting 
spectral properties pertinent to the spectrum expansion application 
are discussed. 


ll. SPECTRUM OF THE TIME-COMPRESSED SIGNAL 
2.1 Derivation 


Referring to Fig. 2, let x(t) be an input signal to an ideal time 
compressor which performs the time compression on each T-second 
segment of the input waveform as discussed before. This is mathemat- 
ically equivalent to first time compressing x(t) in the infinite time 
duration, resulting in x.(t), by the required time-compression factor 
a(a = 1), and then time-shifting segments of T/a-second duration in 
x-(t) to various proper time instants to arrive at y(t), the desired time- 
compressed output as shown in the diagram. We are interested in the 
spectrum (or Fourier transform) of y(t), denoted by Y(/). 

By the above definition, y(t) is related to x.(t) by 


yt) = Yo xt — R(T — 2) ]rect.(t — kT), (1) 


k=—-a 


where 


» (a=1) (2) 


Te 


T 
Qa 
and 
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Fig. 2—An illustrative time-compression sequence. 


rect, (t) A 6 = - (3) 
0, otherwise. 
Using the following, 
x(t) <> X-(f), (4) 
rect,(t) <> 7 sinc afr, (5) 


where <> denotes Fourier transform pair, and 
: sin x 
sinc x 4 ——., (6) 
x 


the Fourier transform of y(t) can be written as 


00 


Yify= 3 | Xi(g) exp[—j2agh(T — 7)] 


k=-w J_o 


xX 7 sinc[7(f — g)t] exp[—j27(f— g)kT]dg. (7) 


Assuming that the summation and integration can be interchanged, 
the above becomes 
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¥(f) = | X_(g)r sine[a(f— g)r] 


x exp[—j2ak(—g7 + fT)]dg. (8) 
Applying the well-known identity of 


foe} oO ao k 
Y exp(-j2nkfT) = Y aFT- =z Y (1-3). (9) 
k=—00 k 


=—00 =—00 


Y(f) is simplified as 


Y(f) = | X-(g)t sine[7(f — g)T] 2 6(fT — gr — k) dg 


y x (4 *) sine| a f- C aM 


y x,|2 (/- ) | sine a[ f(r — T) + R]. (10) 


k=—0 


Using (2) and 








1 
Xe(t) = x(at) > X-(f) = +x(4) ; (11) 


a 


where X(f) is the Fourier transform of x(t), we get the final result of 


vify== ¥ x(1-$) sine|o( 7-4 —E) |. (12) 


The above expression relates the output spectrum Y(/) to the input 
spectrum X(/) and is the basic relationship we work with in the rest 
of the paper. The preceding derivation is the simplest that we are 
aware of (see Appendix A and Ref. 5 for comparison). We now proceed 
to discuss some simple properties of Y(f). 


2.2 Simple properties 


We are interested in the properties of Y(/) that convey information 
about the spectral occupancy problem (or the bandwidth expansion 
factor) in TcM systems. In this regard, we must note that Y(/f) derived 
above is just the Fourier transform of one single time-compressed 
signal in the Tcm system. If there are N users for the channel (i.e., N 
time-compressed signals for transmission), then the total signal in the 
transmission channel is (without post-time-compression filtering): 


N 
2) = ¥ wild), (13) 
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where each y; (f) is a time-compressed signal resembling y(t) considered 
previously. The power spectrum of the total signal in the channel is 
2 


N 
P(f) =|Z(f)? = y Yi(f)| , (14) 








where Z(f) and Y;(f) are the Fourier transforms of z(t) and y;(t), 
respectively. It is well-known that 


N 
P(A) = BLYAAP (15) 


only when the y;,(¢) are all uncorrelated. In the particular case of Tv 
transmission, the various time-compressed TV signals are not totally 
uncorrelated because of the presence of sync pulses, color subcarrier 
bursts, and so on. However, from the point of view of spectral occu- 
pancy (i.e., the total power contained in some passband), consideration 
of Y(f) alone is sufficient. In any event, P(f) is calculable from the 
above if it is needed. 

Without loss of generality, we may normalize T = 1, and Y(f) 
becomes 


=—00 


vi== 5 Xi) sine a ¢-£~ 4). (16) 


The simplest property observable from the above is the output dc 
component in Y(f), ie., 


Y(0) =- y X(—k) sinc(zk) 


k=—0 
1 
= — X(0), (17) 
Qa 


which holds for any general X(f) (see Ref. 5 for comparison). 

Let us now examine the bandwidth property of y(t). We assume that 
the input signal is band-limited to B Hz, ie., X(f) is zero for |f| > B. 
With the normalization of T = 1, X(f) is band-limited to M = B/T. 
Dropping the multiplying constant of 1/a for convenience, and at a 
particular frequency f = af, (recall that a is the time-compression 
factor), 


Y(af,) = YY X(afz — k) sine a7[f-(a — 1) — Rk], (18) 
k=~00 

where the sinc function provides weightings for various points in X(/). 

Since the sinc function is maximum when its argument is zero, it is 

sensible to perform the summation starting with the value of k that 

maximizes the sinc function. Denoting such a value of k by ho, it is 
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given by solving 
fe(a -1) —ko = 0. (19) 
The solution is 
ko =fr(a-—1) +e, |e] = 0.5, (20) 


where ¢€ is necessary because Xp is restricted to be integer, and ko is 
unique, except for the case « = +0.5. It should be emphasized that e 
depends on both f, and a. Using this expression for ko, Y(af,) can be 
written as 


Y(afz) = {X(afz — k) sine a[ f(a — 1) — R}}a=2, 
+ ¥ X(af, — k) sine a[ fe(a — 1) — k] 


hho 
= X(fz — €) sinc 7(-e) 
+ ¥ X(f. —€+ K) sinc z(—e + K), (21) 
K#0 


where K in the summation is taken as K = +1, +2, and so on. A 
graphical representation of the above is depicted in Fig. 3. 

To get immediate insight into the bandwidth property, we consider 
the following two cases: 

Case 1: a = integer, f, = integer. Under this assumption, « = 0 and 
we have a simple relationship of 


Y (afx) = X (fx). (22) 


This means that every integer value of f in X(f) is mapped exactly 
onto afin Y(f). Without the normalization on T, this is equivalent to 
saying that every integer multiple of 1/T in X(f) is mapped exactly 
onto a/T in Y(f) as shown in Fig. 4. If the condition that 1/T «< B 
holds, it is almost certain that the spectrum Y(f) is simply the 


x(f 





Fig. 3—Graphical illustration for the summation in the expression for the output 
spectrum of a time-compressed signal. 
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Fig. 4—Frequency-spectrum expansion property through time compression. 


frequency-expanded version of X(f) by the factor a with some insig- 
nificant sidebands for | f| > aB Hz. In the particular case of TcM/Tv, T 
is taken as a scan line duration, yielding 1/T ~ 15.73 kHz. With B = 
4.2 MHz, BT = 267, using a as the bandwidth expansion factor is, 
therefore, a good rule of thumb for the Tv case. We note that the 
condition of a and f, being integers is merely an artifact due to 
normalization. Therefore, we emphasize that this case is indeed more 
general than it appears to be. For instance, for any given set of a, we 
can always find a set of f, such that this condition holds. 

Case 2: a ¥ integer, f, ~ integer. € is generally nonzero here, and 
Y(af,.) is given by a weighted sum of X(f, — € + 2) [see (21) and Fig. 
3] with X(f, — €) as the main contributor. An alternative view is that 
Y(af,) is the weighted average of X(f, — €) and its neighboring points. 
The output spectrum Y(f) is again a frequency-expanded version of 
X(f), except for ripples created by the averaging process. It is noted 
that « can be zero here resulting in Y(af,) = X(f.). This occurs 
whenever f,.(a — 1) is an integer in (20), e.g., a = 3.5, f, = 106.8. 

The foregoing discussion by and large answers the basic question on 
bandwidth expansion in TcM systems. The result of using the time- 
compression factor a as the bandwidth-expansion factor makes sense 
for most cases where the input spectrum X(f) can be modeled as 
continuous and band-limited. The inclusion of peculiar nulls and delta 
functions in X(f) would complicate the matter, but it can be examined 
in detail using the equations provided above. As to the precise shape 
of Y(f) in comparison to X(af), which is relevant in the spectrum 
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expansion application, some interesting discussions are given in Ap- 
pendix B. 


lll. NUMERICAL EXAMPLES OF OUTPUT SPECTRUM 


We present in this section specific numerical examples of output 
spectra calculated from (12). There are four different types of X(f) in 
the examples: 

(t) Rectangular 


_ fl, lA sB, 
A) = {0 otherwise. (23) 


B is normalized to be 1 Hz in the calculation, and T is taken as 267/B 
seconds (the Tv case). Results for five different values of the time- 
compression factor a are plotted in Fig. 5. The peak-to-peak ripples in 
the output passband (|f| = B) are less than 0.5 dB and the sidebands 
are more than 25 dB down in the vicinity beyond the edge of the 
passband and drop off very rapidly. There is no doubt that most of the 
spectral power is contained in the bandwidth | /f| < B. 
(iz) Triangular 
fl 4) 
xe (ing WS, 
0 , otherwise. 


Q@ = TIME-COMPRESSION FACTOR 


RELATIVE AMPLITUDE IN DECIBELS 





NORMALIZED FREQUENCY IN HERTZ 


Fig. 5—Output spectrum of time-compressed signal with a rectangular input spec- 
trum. 
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B is normalized to be 1 Hz, and T = 267/B seconds. The results are 
plotted in Fig. 6. Here the sidebands are so low that they are not 
observable in the diagram. Of course, this is due to the taper-off 
characteristic in X(f). Some small ripples are again present inside the 
bandwidth |/| = aB. 

(111) Half-Cosine 


_, | eal 212), | 
X(f) = 2B) (25) 


0 , otherwise. 


fs B 
j — 


B is again 1 Hz and T = 267/B seconds. The results are plotted in Fig. 
7, and the same observations as in (iz) apply here. 

(tv) Truncated Half-Cosine 

The equation for X(f) is the same as in (liz) above, except that B is 
0.9362 Hz, which corresponds to a 20-dB taper at X(B) as compared to 
X(0). T is again taken as 267 seconds. The results are shown in Fig. 8 
where the glitches at the edge of the output passband (i.e., f= aB) are 
evident. Note that the sidebands outside the passband are much lower 
compared to those in (z) above. 


IV. BAND-LIMITING EFFECTS 


When the original input signal x(t) is band-limited to B Hz, we have 
shown that most of the power in the output time-compressed signal 


RELATIVE AMPLITUDE IN DECIBELS 
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Fig. 6—Output spectrum of time-compressed signal with a triangular input spectrum. 
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& = TIME-COMPRESSION FACTOR 
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Fig. 7—Output spectrum of time-compressed signal with a half-cosine input spectrum. 


a= TIME-COMPRESSION FACTOR 
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Fig. 8—Output spectrum of time-compressed signal with a 20-dB tapered half-cosine 
spectrum. 
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y(t) is confined to |f| = aB Hz. Band-limiting y(t) up to aB Hz 
therefore should hardly affect the fidelity of the signal itself. However, 
such filtering does create ripples following each time-compressed signal 
burst, where a sharp edge occurs in the time waveform. These ripples 
constitute interburst interference and could be a potential source of 
degradation in a TCM system. We demonstrate via a simulation on a TV 
signal in this section that the problem can be alleviated by introducing 
a small guard time (about 2 percent of the burst duration) between 
time-compressed signal bursts from different users. 

In our computer simulation, we generate a TV test signal which is 
similar to the composite test signal in Ref. 6. A scan line (64 us) of this 
is shown in Fig. 9. We first band-limit the test signal by a low-pass (LP) 
filter with zero delay and a raised-cosine amplitude roll-off of 








l-r 
<= fF, +——, 
1 _ {tor f|-hi) a 
A(f) = 5 {1~sin| SU) oe (26) 
l-r 1l+r 
<= —-fy,< 
> Qto < |f| 1 2to > 


0 , otherwise. 


where the parameters F), to and r (F; = 0, t) = 0;0 =r 1) control the 
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Fig. 9A modified composite test signal. 
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0 Fy Fo Fz AF, +1/to 
f 


Fig. 10—A low-pass filter with a raised-cosine roll-off. 


filter shape. Instead of using fp and r to define the filter shape, we use 
the two critical frequencies defined by (Fig. 10): 


1 ; 
Fo 4 Fi, +—; 2 
2=f1 Qt’ (27) 
1l+r 
F; 4 °F, + (28) 





0 


The specific values for F,, Fs, and F; are 4.2 MHz, 4.8 MHz, and 5 
MHz, respectively. 

After the initial band-limiting, we nenoen an ideal time compression 
over each scan line, and the signal is then compressed (with a = 2) 
into time bursts, each of which is 32 us long. We then filter the time- 
compressed signal by the LP filter referred to above with Fi, Fo, and F3 
at 8.4 MHz, 9.6 MHz, and 10 MHz, respectively. The ripples following 
each 32-ys signal burst are observed, and the data are plotted in Fig. 
11 where the ripple magnitude is defined as the ratio of the peak-to- 
peak ripple voltage to the peak-to-peak picture voltage. As seen from 
the figure, a guard time of 0.5 us is sufficient for controlling the 
interburst interference. This amounts to about 2 percent of the burst 
duration. 


V. CONCLUSION 


We have studied two important and fundamental aspects of Tem— 
the spectral properties and band-limiting effects of the time-com- 
pressed signal. We find under the assumptions that (z) the original 
input signal x(t) is band-limited to B Hz, (iz) x(t) is segmented into T- 
second intervals before time compression by a factor of a(a = 1), and 
(ui) 1/T « B, then essentially all the spectral power in the output 
time-compressed waveform is confined to frequencies below aB Hz. 
This result is immediately applicable to the Tv case. Numerical ex- 
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Fig. 11—Ripple magnitude following a time-compressed signal burst in the TV simu- 
lation. 


amples on various types of spectra verify this and show that the 
spectral sidebands beyond aB Hz are very low. We also find that 
filtering of a time-compressed TV signal creates small ripples following 
each signal burst, but the interburst interference due to these ripples 
can be kept negligible with a small guard time (about 2 percent of the 
burst duration) between different signal bursts. 
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APPENDIX A 
An Alternate Derivation for the Spectrum of the Time-Compressed Signal 


Referring to Fig. 2, the input signal can be written as 


x(t) = y x(t) rectr(t — iT), (29) 


i=—0o 


where rect, (¢) is defined in (3). The output time-compressed signal is 


2180 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


y(t) = BY x al _ ir(1 _ *)|| rect,(t —iT), (30) 


where a is the time-compression factor, and 7 is as defined in (2). Note 
that 


rect,(¢ — iT) <— 7 sine(zfr) exp(—j27fiT), (31) 


e202) 2 ef 


The spectrum of y(t) involves the convolution of the two right sides in 
(31) and (32) and is 





YG) 7a) X(B) exp[—jBiT (a — 1)] 


X sinc[zr(f — Ba)] exp[—j(27f — Ba)iT]dB. (38) 


Using the identity of (9), the above can be rewritten as 
Via | X(B) since[ar(f — paris| 7( F- £) = ‘Jap. (34) 


With a change of variable of 
o=1(f- fy. (35) 


Qa 


Y(f) becomes 


rail ” Tf-3o\ . = Tf-—o _ 
Y(f) -ty] x F ) sine wo F a) [av i)do 
Ey La 
x(f- =) sine| me(f- fa + =) | 
i\. ft 
2x(/-3) sine o(¢-£- 5) : (36) 


which is the same as (12). 
A simple way to check Y(f) is of course to let a = 1 in (36), which 
yields 








Rl Rie 


Y(f) =¥ x(f- =) sinc 7(—i) 
= X(f), (37) 
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as expected. Another way to check Y(f) involves letting a — ©. In 
doing so, we have to assume that the energy in x-.(t) is preserved 
through the time compression, i.e., 


[ sae | x°(t) dt. (38) 


This means that the multiplying factor 1/a in (36) can be dropped, 
and with a—> ©, 


Y(f) = yx(7-3) sine| o( 7-5) (39) 


We now try to verify (39) by a different means. Using a > © as 
described above, the output time waveform is 


2 x(t) dr 


Pas 
2 


. T 
iT + 


y(t) = 8(t - iT) i 


=) 6(¢ -—iT)y(iT). (40) 


U 


Note that if Y(f) © y(t), then 
1 s¥ fot @ ¥ 6(t — iT) y(iT) (41) 
ie tf mare? 

Consider now a general definition of 


T 
x(t) A i Gao) de 


= f x(t) rectr(t — 7) dr. (42) 


The Fourier transform of y(t) is then 
Y(f) = X(f)[T sine(zfT)]. (43) 
Using the above, (40) and (41), we get the desired result of (39). 


APPENDIX B 
Additional Spectral Properties of the Time-Compressed Signal 


With T = 1, we show in (21) that the spectral output of f = af, in 
Y(f) is 


Y(af.) = X(fz — €) sine 7(—e) 
+ ¥ X(f—e+K) sine z(-e+ K), (44) 
Kx0 
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where e depends on both a and f,. Whenever e = 0, we get 
Y(afz) = X(f2), (45) 


which is a desirable result for spectrum-expansion applications. Con- 
sider now a simple example, where a = 2.1 and X(f) consists of two 
spectral lines at f = +5 Hz (a constant sine wave). We expect two 
spectral lines in Y(f) at f = +2.1 X 5 = +10.5 Hz. But using f, = 5 in 
(44), we have 


Y (10.5) = X(5 — 0.5) sine 7(0.5) 
+ Y X(5—0.5+K) sinc 7(-0.5+ K)=0, (46) 


K#0 

which is so because each term containing X(f) in the summation is 
identically zero as X(f) admits nonzero values only at f= +5 Hz. This 
absence of output at f = 10.5 is due to the fact that Y(f) admits 
nonzero outputs only at f= integer, or equivalently in multiple spacings 
of 1/T in the unnormalized case. The segmentation of the input signal 
x(t) into T-second intervals can therefore be viewed as making the 
frequency resolution in Y(/) 1/T. 

As mentioned previously, when e # 0 in (44), the output Y(a/f,) can 
be viewed as some weighted average of X(f; — €) and its neighboring 
points. How close is this average to the value X(f.)? We do not have 
a satisfactory answer, but the following discussion is interesting. Let 
us change the notation in (44) to 


X(fc) = X(fe — ©) sinea(—e) 
+ ¥ X(f. -—€+ K) sinc 7(-e+ K), (47) 


K¥0 

where X (f-) denotes an interpolated or estimated value for X(/). A 
physical interpretation on the above was presented in Fig. 3. An 
alternate view is to rewrite the second term in the above as 

Y X(f. —€ + K) sine r[(f. —€ + K) — fr], (48) 

K¥0 
which is graphically interpreted in Fig. 12. We now do the following 
manipulations: 


X(f) = X(fe —€ + K) sine o[(f. —€ + K) -— fa] 
K 


1/2 


= Y X(fe- e+ K) i exp {j27t[(f: —€ + K) — f.]} dt 


-1/2 
1/2 


Y X(f. — € + K) exp[y2at( fi —e + K)] 


-1/2 * 


xX exp(—j2afxt) dt. (49) 
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Fig. 12—Alternative representation for the summation in the output spectrum of a 
time-compressed signal. 


We note that the term 
Y X(fx — € + K) exp[j27t(f. — € + K)] (50) 
K 


is a Fourier series with a period of unity. Denoting this Fourier series 
by x(t), we see that 


x(t) = exp[j2rt(fi — )] Y X(f — € + K) exp(j27Kt) 
K 
= x x(t — k) exp[j27k( fz — €)], (51) 


which is a complicated summation of various time- and phase-shifted 
versions of x(t). 

We now extend our previous results to a limited case applicable to 
Refs. 5, 6. Consider the special case where a is an integer, denoted by 
N. We take the time-compressed waveform y(¢) in Fig. 2, time shift it 
by iT/N (t = 0 to N — 1) and add all N waveforms together. The 
resultant waveform has no blank time interval and is basically a 
staggering of N time compressed y(t) which we have considered so far. 
This can be written as 


N-1 . 
2(t) = » o(e- , N=2. (52) 


Its Fourier transform is 
N-1 iT 
A(f) = Y(f) ¥& exp ( </20f | 
i=0 


= Y(f) 1 — exp(—j27fT) 


1- exp (7205) 
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(53) 


where Y(f) is the Fourier transform of y(t). The magnitude of the 
second complex factor is 






1 — exp(—j27fT) 1 — cos 2afT 
a py 
1— exp( ~/20f 5) 1- cos( 2a 5] 


The above term vanishes whenever only the numerator vanishes, and 
is nonzero when both numerator and denominator vanish simultane- 
ously. This means that the above term is zero when 


(54) 








k 
f= p (k = integer), (55) 
except at those points where 
N 
f=k 7 (56) 


holds. Therefore, one may infer that the frequency resolution in this 
case is N/T as compared to 1/T in the single y(t) case. 


TIME-COMPRESSION MULTIPLEXING 2185 


Copyright © 1981 American Telephone and Telegraph Company 
THE BELL SYSTEM TECHNICAL JOURNAL 
Vol. 60, No. 9, November 1981 
Printed in U.S.A. 


Transmission Cathodoluminescence 


By A. K. CHIN, H. TEMKIN, and S. MAHAJAN 
(Manuscript received May 11, 1981) 


The application of transmission cathodoluminescence (TCL) in eval- 
uating the quality of luminescing materials is reviewed. This scan- 
ning electron microscope technique is particularly useful in imaging 
localized nonuniformities, such as dislocations and inclusions, in 
semiconductors. Understanding and analyzing such material defects 
are of practical importance since they greatly affect device perform- 
ance and lifetime. After a detailed description of TCL, the advantages 
of this technique in comparison to defect etching, cathodolumines- 
cence imaging, and electron beam-induced current (EBIC) is pre- 
sented. Since TCL is simple to perform, this technique can be used to 
evaluate and monitor material growth and device processing proce- 
dures. Although TcL may be applied to any luminescing material, this 
paper demonstrates its usefulness for the GaAs/GaAlAs and InP/ 
InGaAsP materials systems which provide most sources and detectors 
for optical communication. 


|. INTRODUCTION 


Diode lasers, light-emitting diodes (LEDs), and photodiodes are 
integral components of the current lightwave transmission systems. 
These devices are mainly based on the III-V materials systems, e.g., 
GaAs/GaAlAs and InP/InGaAsP. Both material and device develop- 
ment are recent, relative to the highly developed Si-based technology. 
Material defect analysis is required to evaluate and monitor material 
growth and device processing procedures, especially during the early 
stages of development. 

The performance and reliability of optoelectronic devices are deter- 
mined by the quality of materials and fabrication processes. Threading 
dislocations have been shown to increase the leakage current of both 
InP and GaAs photodiodes;’” these dislocations are also sources of 
microplasmas in avalanche photodiodes.*” The quantum efficiency of 
GaP and GaAlAs:Si LEDs is strongly dependent on the dislocation 
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density.®’ Threading dislocations and inclusions are known sources of 
dark line and dark spot defects in degraded LEDs and solid-state lasers 
fabricated from the GaAs/GaAlAs and InP/InGaAsP materials sys- 
tems.*” Finally, stresses from dielectric coatings are known to induce 
defects and accelerate device degradation.’*”* 

Characterization techniques are required to assess the effects of 
growth and processing procedures on material or device quality. Be- 
cause of the variety of defects, materials, and devices, a number of 
characterization techniques have been developed. These techniques 
are listed and compared in Table I. 

The purpose of this paper is to review the most recently developed 
technique of transmission cathodoluminescence (TCL) and to compare 
it with other defect analysis techniques currently in use.’°’*”* Trans- 
mission cathodoluminescence is performed on a scanning electron 
microscope (SEM); the magnification and depth of field available in an 
SEM are exploited. 


1.1 Defect analysis techniques 


The technique with the highest magnification and resolution, trans- 
mission electron microscopy (TEM), reveals the nature and origin of 


Table I—Comparison of various characterization techniques 
Technique Disadvantages 


IR microscopy Low resolution 
Dependent on detector sensitivity 


Scanning photoluminescence Requires removal of contact layer 
Dependent on detector sensitivity 
Difficult to identify features because of lack of an 
optical image 


Scanning photocurrent Requires p-n junction or Schottky barrier 
Requires electrical contacts 
Difficult to identify features because of lack of an 
optical image 


CL Requires removal of contact layer 
Dependent on detector sensitivity 


EBIC Requires p-n junction or Schottky barrier 
Requires electrical contacts 

TCL Dependent on detector sensitivity 

Defect etching Material, orientation, dopant, and etch dependent 
Destructive 

X-ray topography Low resolution 
Slow 
Restricted to relatively small samples 

TEM Difficult sample preparation 
Destructive 


Small volume of material probed 
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material defects. However, it cannot be used routinely for screening 
defective materials because it is destructive, the sample preparation is 
very difficult, and the volume of material probed is small relative to 
device dimensions. X-ray topography is relatively slow, has limited 
resolution, and is limited to small samples because of internal stresses 
generally associated with device wafers."*” The evaluation of wafers 
by electron beam-induced current (EBIC) scan, cathodoluminescence 
(cL) scan, photoluminescence (PL) scan, photocurrent scan, and in- 
frared microscopy have also been reported.®”** The electron excita- 
tion techniques—EBIC and CL scan—are more useful in comparison to 
the conceptually similar photoexcitation techniques, i.e. photocurrent 
and PL scan, because of the availability of an exactly corresponding 
secondary-electron (SE) image to isolate surface details. Even when 
this disadvantage is overcome by careful experimental design, dislo- 
cations which are responsible for poor device performance and relia- 
bility cannot be readily imaged using these techniques, although some 
specific examples may be found. For example, dark spots in CL images 
have been identified to be dislocations only for the case of GaP, GaAs, 
and CdTe.®***** Additionally, it is difficult to observe dislocations in 
GaAs:Se.”’ Moreover, dislocations have not been imaged in GaAs:Si. 

Etching is the simplest and, thus, the most widely used method for 
revealing defects. Once the identity of etch features has been estab- 
lished, evaluation of material quality by etching is relatively fast. 
Additionally, etching has sufficient resolution to reveal a defect density 
as high as ~10’ cm~”. However, aside from being destructive, etching 
is material, material orientation, and dopant dependent. For example, 
the KOH etch readily reveals dislocations on the {100} and {111} 
surfaces of GaAs.***” However, it cannot be used for the {110} surface 
which is the cleavage plane and, thus, forms the mirror faces of solid- 
state lasers. Huber etch works well for the {100} surface of InP but 
not for the {110} surface.*® The A-B etch reveals dislocations on the 
{111} surface of InGaAsP, but no etch has been developed for the 
technologically important {100} surface.*”*° To estimate dislocation 
densities of InGaAsP layers in a device structure, the InP confining 
layers had to be etched.” Finally, the R-C and A-B etchants are 
successful for the {111}B face of GaP but cannot be used for other 
orientations.***4” 

The interpretation of etch patterns is often difficult even with known 
defect etchants.”' Figure 1 shows the response of an InP crystal to four 
different etchants reported to reveal dislocations: (a) Huber etch, (b) 
6:6:1, (c) RRE, and (d) modified A-B etch.****” The sample is the 
{100} surface of a crystal doped with Sn to 2 x 10’° cm™ and is a 
typical substrate for device wafers. A brief comparison shows large 
differences in these etch patterns. Dislocation and saucer pits are well 
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are in between that of the Huber and modified A-B etches. Therefore, 
the use of an etchant to reveal defects is often unreliable. 

Transmission cathodoluminescence reveals defects as dark spots, 
similar to the cL and EBIC techniques." The characteristic images of 
dislocations have been identified by a one-to-one correspondence with 
a known dislocation etchant.’*” Dislocations are roughly the same size 
and have a comma shape. The comma shape is due to the dislocation 
intersecting the surface of the sample at an oblique angle. Inclusions 
whose contrast is due to nongeneration of CL radiation have random 
shapes and sizes. The improved collection efficiency and measurement 
geometry of TCL in comparison to CL allows defects to be readily 
imaged in many more materials. The materials to which TCL has been 
applied is listed in Table II. 


il. EXPERIMENTAL 
2.1 Transmission cathodoluminescence technique 


A schematic of the TcL technique is shown in Fig. 2. Transmission 
cathodoluminescence is accomplished by placing a light detector un- 
derneath a sample. The CL radiation generated on the top surface 
provides illumination which is transmitted through the sample and 
detected on the opposite side. Both sides of the sample must be 
optically smooth to reduce surface artifacts in TCL images. The TCL 
method has an efficient light collection geometry and no modification 
of the SEM is required. On the other hand, additional optics are 
required in the CL technique to guide the CL to an external detector. In 
Fig. 2, surface defects with their lower cL efficiency are detected as a 
decrease in CL radiation transmitted through the sample. Volume 
defects, i.e. defects which lie beneath the electron-hole (e-h) excitation 


Table Il—Different substrates and device wafers examined by TCL 


GaAs/GaAlAs InP/InGaAsP 
Impurity GaAs:Te InP:S 
striations GaAs:Si InP:Se 
InP:Te 
InP:Sn 
InP:Zn 
Dislocations; GaAs:Te InP:S 
defects GaAs:Si InP:Se 
GaAlaAs:Si LED wafer InP:Te 
GaAs;GaAlAs LED wafer InP:Sn 
GaAs/GaAlAs laser wafer InP:Zn 
InGaAsP 
InP/InGaAsP LED wafer 
InP/InGaAsP laser wafer 


Degradation GaAlAs:Si optoisolator LED InP/InGaAsP transmission LED 
GaAlAs data link LED 
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Fig. 2—Schematic of TCL measurement. Both surface and volume defects are detected 
by the solid-state detector as a decrease in luminescent radiation. The surface defect 
has a lower luminescing efficiency and the volume defect shadows the dector from the 
luminescing surface. 


volume of the electron beam, are detected by their interaction with 
the surface-generated CL radiation passing through the sample to the 
detector below. A void in the sample volume will increase light trans- 
mission because of reduced absorption. A volume defect will decrease 
light transmission because of increased absorption or scattering. Using 
this method, defects deep within a thick sample can be observed. 
Although the transparency of a sample to its own CL radiation is 
material dependent, in the following we give TcL data on samples 
whose thickness is comparable to that of substrates and device wafers. 


2.2 Experimental consideration 


The data given was obtained with an ETEC Corporation autoscan 
SEM, but any SEM with similar capabilities may be used.*° Two solid- 
state detectors were used for the TCL measurements, depending on the 
spectral region of interest. For wavelengths less than ~1.0 pm, a silicon 
photodiode was used; for wavelengths shorter than ~1.6 um, a germa- 
nium photodiode was selected. These detectors have low noise equiv- 
alent power (NEP), fast response at zero bias, and output linearity with 
light input. The performance specifications of the photodiodes are 
listed in Table III. 

For typical SEM parameters of 20 keV and 107’A for the electron 
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Table III—Characteristics of photodiodes used in TCL experiments 


Silicon Photodiode Germanium Photodiode 
Active area 0.20 cm? 0.20 cm? 
EP 8 x 10° W/VHz @ 850 nm 6 x 107'' W/VHz @ 1300 nm 
Responsivity 0.35 A/W @ 850 nm 0.22 A/W @ 1300 nm 


beam, the luminescent power radiated by the sample is approximately 
10°-°W, assuming a CL conversion efficiency of 10~?. This conversion 
efficiency is typical of GaAs but is lower for other materials.*’ Most of 
this radiation is collected by the detector in the TcL configuration 
since it is placed 2-3 mm from the luminescing source. The solid angle 
subtended by the photodiode in the TCL geometry is ~z, whereas the 
collection angle in the usual cL mode is ~10-°z. 

Since the silicon and germanium diodes have an NEP of 10°’? W/ 
VHz and 107!? W/ VHz, respectively, the detectors are capable of sens- 
ing 10°'°W and 10°°W, respectively, of TCL power at a bandwidth of 
10° Hz required for SEM imaging. Approximately two orders of mag- 
nitude greater power is required to easily image material defects 
without the use of difficult noise reduction techniques such as lock-in 
amplification coupled with beam blanking. This power is readily avail- 
able from the materials listed in Table II. 


2.3 Resolution 


The resolution of the TcL technique is limited by the type of defect 
observed. The contrast from dislocations is due to nonradiative recom- 
bination; therefore, it is limited by the minority carrier diffusion length. 
The contrast at inclusions is due mainly to nongeneration of CL, and 
the resolution of these defects is limited by the electron beam condi- 
tions as in SE imaging. Finally, volume defects can be resolved to 
roughly the CL emission wavelength since the defect is effectively 
observed by optical means. 

Figure 3 shows an example of two different defects appearing in the 
same InP:S sample. The large dark spots are due to dislocations 
intersecting the surface, whereas the much smaller dark spots are 
inclusions located within an e-h excitation volume of the surface. The 
identification of defects using TCL will be discussed in Section 2.4. The 
dislocation image is large because of the large minority carrier diffusion 
length. By imaging these dislocations, the minority carrier diffusion 
length is readily estimated in this sample to be ~5 pm. The images of 
inclusions show that defects as small as 1-2 ym can be imaged under 
the beam conditions of 20 keV and 1077A. 

Figure 4 is an example of a volume defect imaged by TCL. Figure 4a 
is the SE image of an unusual feature appearing in an InP/InGaAsP 
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Fig. 3—Transmission cathodoluminescence image of heavily S-doped InP substrate. 
The large and small dark spots are dislocations and inclusions, respectively. 


wafer. A CL picture of the same region, Fig. 4b, shows the area to be 
nonluminescent. It is only the TcL image, Fig. 4c, that explains the 
origin of this dark area. One can clearly see a bright spot in the center 
of the dark region associated with a void in the epitaxial layers. Since 
the void does not appear in the CL image, it lies below the e-h excitation 
volume. 


2.4 Interpretation of TCL images 


In general, defects appear as dark spots in a TCL image. To interpret 
images, dark spots must be identified. For the cL technique, dark spots 
in images of GaAs:Te, GaP, and CdTe were identified to be dislocations 

-by demonstrating an exact one-to-one correspondence between these 
dark spots and dislocations revealed as etch pits produced by a known 
defect etchant.®°”**> We have followed this procedure for the TCL 
technique and applied it to GaAlAs:Si, GaAs:Te, InP:Te, and InP: 


S 14,15 


Two methods were used to show that dark spots observed in a TCL 
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Fig. 4—(a) Secondary electron image of a surface feature on an InP/InGaAsP wafer. 
(b) Cathodoluminescence image of the region shown in (a). The dark circular region 
shows that the surface feature is nonluminescent. (c) Transmission cathodoluminescence 
image of the region shown in (a). The bright dot in the center of the dark area is a void. 


image represent grown-in dislocations. In the first method, a TCL image 
of a sample was taken prior to etching to reveal dislocations. By 
comparison of the etch pattern with the TCL image of the same region, 
an exact one-to-one correspondence was established. In the second 
method, a {100} wafer was etched to reveal dislocations as etch pits. 
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Subsequently, the samples were cleaved along the {110} cleavage 
planes. By examining with TcL the {110} surface where it intersects an 
etch pit, a dark region immediately below each pit was found. Thus, 
each dislocation intersecting the {100} surface corresponds to a dark 
spot in a TCL image. The technique of first etching the surface and 
then showing dark spots at dislocation pits is not useful since surface 
features always appear in either a CL or TCL image. 

In Fig. 5a, a TCL image of a Si-doped Ga_,Al,As layer grown by 
liquid phase epitaxy (LPE) on a {100} oriented GaAs substrate is 
shown.” The aluminum concentration varies from x = 0.00 on the top 
surface to x = 0.30 at the substrate-epitaxial layer interface. This 
gradient occurs over a thickness of ~250 ym. The sample has been 
bromine-methanol polished to produce smooth, damage-free surfaces. 
Only dust particles are observed on the top GaAlAs surface when 
examined by SE imaging and no features could be seen in normal cL 
using an S1 photomultiplier. This observation is in agreement with 
previous measurements on similar material.’ However, when examined 
by TCL, a number of randomly distributed dark spots appear. The size 
of these dark spots varies from 5 to 10 um. At high magnification, their 
shape closely resembles commas; that is, the central dark region is 
connected with a fainter tail. The average density of these dark spots 
is approximately 2 x 10* cm”. 

Figure 5b shows the SE image of the region shown in Fig. 5a. The 
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Fig. 5—Transmission cathodoluminescence image of a GaAlAs:Si epitaxial layer 
grown on {100} GaAs substrate. The dislocations appear as dark spots approximately 10 
pm in diameter. (b) Secondary electron image of the area shown in (a) after 30 min in 
300°C KOH etchant. The circles surround the small (1- to 2-~m diameter) dislocation 
pits. The squares surround the large (5 to 10 pm) dislocation pits. 
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sample has been etched in molten KOH at 300°C for 30 min to reveal 
dislocations. The dislocation pits in {100} GaAlAs, as revealed by this 
etchant, are hexagonal in shape and terminate in a central point.” 
Two distinct dislocation pit sizes are observed independent of etching 
time. The pit size has been related to the dislocation inclination.” A 
careful comparison of Figs. 5a and 5b shows a one-to-one correspond- 
ence between the large (5- to 10-um diameter) dislocation pits sur- 
rounded by a square in the figure and the Tc dark spots. In addition, 
apparently similar TCL spots also correspond to the small (1- to 2-um 
diameter) dislocation pits surrounded by a circle. 

Figure 6a is a TCL image of a Te-doped InP crystal. The vertical 
unevenly spaced lines are the impurity-induced growth striations. 
Their contrast is due to the dependence of the cL efficiency on doping 
density and not due to unresolved defects.” The dark band on the left 
of the figure is a scratch intentionally introduced to specify the 
location. Neither the striations nor the dark spots indicated by letters 
A through D are apparent on either a CL or an SE image. At higher 
magnifications, the dark spots have comma shapes as was found for 
the GaAlAs:Si sample in Fig. 5. The feature indicated by an arrow is 
a dust particle which appears also in the SE image. 

Figure 6b is a Nomarski optical photograph of the region shown in 
Fig. 6a after Huber etching to reveal dislocations. The magnification 
of Figs. 6a and 6b are slightly different. The four dislocation pits are 
labeled A through D for their corresponding TcL dark spot; an exact 
correspondence between dark spots and dislocation pits is found by 
comparing Figs. 6a and 6b. As compared to the freshly etched surface 
of Fig. 1a, Huber etching followed by TcL scanning produces a fairly 
poor surface. In addition, the growth striations are not clearly revealed. 
It is believed that carbon deposition during the TCL examination 
interferes with the etching process. The carbon is due to electron- 
beam decomposition of residual organics within the sample chamber. 
Etch pit D is somewhat obscure because the carbon film deposited 
over that area was twice as thick, a result of overlapping scan regions. 

Figure 7a shows an SE image of the cleaved {110} surface of an InP: 
S sample after Huber etching and cleaving. Several dislocation pits 
produced on the {100} surface are intersected. A TcL scan of the 
corresponding area is shown in Fig. 7b. A dark region extending into 
the sample beneath each pit is observed, indicating that the pit 
corresponds to a dark spot on a TCL image of the {100} surface. Since 
the dark regions do not extend further into the crystal, the observed 
images could be due to portions of dislocation loops. 

The above examples of correspondence between etch features and 
TCL images brings out several advantages of TCL over etching. First, 
TCL can be applied to any crystal face, whereas defect etchants are 
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Fig. 6—(a) Transmission cathodoluminescence image of {100} surface of InP:Te. (b) 
Secondary electron image of the area shown in (a) after Huber etching to reveal 
dislocations. Corresponding features in (a) and (b) are labeled A through D. 
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Fig. 7—(a) Secondary electron image of {110} surface of InP:S. Several etch pits on 
the {100} surface produced by Huber etching are intersected on the right-hand edge of 
the sample. (b) Transmission cathodoluminescence image of area shown in (a) displaying 
a dark region below each etch pit. 
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extremely orientation sensitive. Neither the KOH nor the Huber etch 
reveals defects on the {110} surfaces used as light-emission facets for 
lasers. Secondly, a defect etchant must be developed for each semicon- 
ductor, whereas TCL requires, at most, a change in detectors to accom- 
modate the spectral range. Additionally, the development of a defect 
etchant is difficult and time consuming. Finally, etching is extremely 
sensitive to the surface preparation. The etchant may not work 
properly if the surface has been contaminated. 


Ill. SUBSTRATES/EPILAYERS 


The common substrates used for optoelectronic devices are GaP, 
GaAs, InP, and GaSb. Since good photodetectors at 1.6 um are una- 
vailable, dislocations in GaSb have not been imaged by any lumines- 
cence technique. In the case of GaP, dislocations appear readily under 
examination by CL or TCL. Using CL, however, dislocations in GaAs are 
readily imaged only for the case of Te doping; dislocations in GaAs:Se 
are difficult to image and have not been imaged in GaAs:Si.”’ Finally, 
dislocations in InP have not been observed for any dopant with cL 
scanning. However, using TCL, dislocations are easily observed for the 
commonly available GaAs and InP substrates.'°'*’” These materials 
are listed in Table II. We will demonstrate the usefulness of the TCL 
technique on selected GaAs and InP substrates. 


3.1 GaAs substrates 


Figures 8a and 8b are, respectively, the SE and TCL images of a 375- 
pm-thick GaAs:Si substrate commonly used for the growth of LED and 
laser wafers. The white spot in the upper right of Fig. 8a is a dirt 
particle that appears dark in the TcL image; the dirt particle effectively 
blocks the electron beam from the semiconductor and prevents CL 
generation. The remaining comma-shaped dark spots with no corre- 
sponding features in the SE image are due to dislocations intersecting 
the surface at an oblique angle; the direction of the tail indicates the 
sense of inclination. This TCL image is typical of GaAs substrates 
grown by the gradient freeze technique; growth striations show weak 
contrast and are rarely observed. The dislocation image is unchanged 
when the dopant is changed from silicon to tellurium, another standard 
dopant for GaAs substrates. 


3.2 InP substrates / epilayers 


The recent work of Seki et al. has demonstrated that the macro- 
scopic perfection of InP crystals grown by the liquid encapsulated 
Czochralski (LEC) technique can be improved by heavy doping.*””’ The 
order of effectiveness in reducing the dislocation density is Zn, S, and 
Te. It is of considerable technological interest to assess whether or not 
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Fig. 8—(a) Secondary electron image of the {100} surface of a GaAs:Si substrate. The 
white spot on the upper right is a dirt particle. (b) Transmission cathodoluminescence 
image of the area shown in (a) displaying four dislocations. The diameter of the 
dislocation image is approximately twice the minority carrier diffusion length. 


the macroscopic perfection of such highly doped substrates can be 
replicated into isoepitaxial layers grown by LPE. This is especially 
important since it was shown by Mahajan et al. that although heavy 
zinc doping reduces the dislocation density, a high density of Zn related 
precipitates results.”” Similar effects have also been found for heavy 
Ge doping of InP.” Although individual precipitates, which are ~675 
A in dimension, cannot be resolved using TCL, precipitate clusters can 
be imaged.” Transmission cathodoluminescence imaging in this ex- 
ample is important since neither the precipitates nor their clusters 
show enough strain contrast to be observed either in X-ray topography 
or in Huber etching.” TEM is required for a detailed study, but TCL is 
useful for a simple, macroscopic investigation. 

To facilitate the location of corresponding areas of substrate and 
epilayers in the TcL study,” a pattern was produced on one side of the 
wafers by evaporating gold through a shadow mask. This pattern may 
be observed irrespective of which side is excited with the electron 
beam. A similar CL or EBIC study would not be easy since corresponding 
areas of a wafer on opposite sides are difficult to locate in these 
techniques. In the TCL study, each side of the wafers is examined with 
the electron beam since maximum contrast and resolution of defects 
are obtained when the defects lie within the e-h excitation volume. 

As ascertained by etch pitting, the quality of epilayers grown on the 
Zn-doped substrates was satisfactory and compared well with that of 
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the substrate.” Occasionally, etch pits in the form of clusters were 
observed. The TCL study, however, revealed additional features within 
the epilayers. These results are presented in Fig. 9. Figure 9a is the sE 
image of the 10-ym-thick InP:Sn epitaxial layer. This epilayer is a 
typical buffer layer used in device wafers. The small white features are 
indium-rich beads caused by localized surface decomposition. The 
beads appear as dark spots in the TCL image of Fig. 9b. In addition, 
however, a high density of light grey circular areas, whose origin at 
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Fig. 9—(a) Secondary electron image of a 10-um-thick InP:Sn epitaxial layer. (b) 
Transmission cathodoluminescence image of region shown in (a). (c) Transmission 
cathodoluminescence image of the underlying InP:Zn substrate. 
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Fig. 10—(a) Secondary electron image of a 10-m-thick InP:Sn epitaxial layer. (b) 
Transmission cathodoluminescence image of region shown in (a). (c) Transmission 
cathodoluminescence image of the underlying InP:S substrate. 


present is uncertain, is apparent. These defects are likely related to 
the precipitates within the substrate, although corresponding features 
are not present in the substrate of Fig. 9c. Only growth striations and 
two defects are observed in the substrate. The dark border in Figs. 9b 
and 9c is due to the gold mask evaporated on the substrate side. A 
mirror inversion about a horizontal axis is required to exactly compare 
the image of the substrate with that of the epilayer. 

Figure 10 shows images of a 10-ym-thick InP:Sn layer grown on a 
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heavily S-doped substrate. Figure 10a shows the sE image of the 
epilayer surface. The indium-rich beads on the surface are again 
evident. Figure 10b demonstrates that this portion of the epilayer is 
dislocation free and correlates very well with the quality of the sub- 
strate as illustrated in Fig. 10c. Nearly horizontal light and dark bands 
are growth striations, and darker regions on some of these bands may 
represent local dopant variations. 

As a result of these studies, it was confirmed that macroscopically 
dislocation free InP may be obtained by heavy Zn or S doping. 
However, defect-free isoepitaxial layers may be grown only on the S- 
doped substrates. The defects in the epilayers on InP:Zn are probably 
a result of precipitates within the substrate. These precipitates have 
been observed by TEM in low-dislocation InP:Zn but not InP:S. There- 
fore, it is recommended that InP:S crystals having low-dislocation 
density be used as substrates for the growth of device wafers. 


3.3 InGaAsP epitaxial layers 


For InGaAsP layers whose bandgap corresponds to 1.3 um, the Si 
photodetector normally used to detect transmitted CL was replaced 
with the Ge photodiode. Although cL radiation transmitted through 
the sample was detected by the Si diode in all cases, defects appeared 
for most of the InGaAsP samples studied only in the TcL image 
obtained with the Ge detector. Apparently, for these samples, the 
defects affected the CL emission and optical properties in the spectral 
region beyond the Si bandgap, i.e. wavelengths longer than 0.9 ym. 

Figure 11 shows an SE and TCL image ofa 2-um-thick n-type, 
nominally undoped, InGaAsP epilayer grown on a 15-mil-thick, {111} 
oriented InP substrate. This TcL image was obtained using the Ge 
photodiode. The InP substrate does not contribute to the TCL image 
since it is transparent to the InGaAsP cL centered at 1.3 um. Dark 
lines correspond to surface scratches or cracks seen in an SE image. 
The lower region of Fig. 11b is darker than the upper region because 
of a slight nonuniformity in the cL or optical properties of the quar- 
ternary layer. The dark spots, approximately 2 um in diameter and 
similar in shape, have no corresponding surface features. By analogy 
with the TcL study of dislocations in GaAs and InP, it is suggested 
that these spots are caused by dislocations in the epitaxial layer. 
Additionally, it has been shown by etching studies that dislocations in 
the substrate are sources of dislocations in quarternary layers. The 
average density of dark spots imaged by TcL, 2-3 xX 10* cm’, is 
approximately the same as that of the InP substrate. Finally, the 
minority carrier diffusion length of InGaAsP has been measured to be 
1 to 2 pm in agreement with the diameter of the dark spots.”? 

It should be noted that the Ge detector is unsuitable for TCL imaging 


2204 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 





50 um 50 pum 
(a) (b) 


Fig. 11—(a) Secondary electron image of a 2-um-thick n-type InGaAsP layer grown 
on a 15-mil-thick {111} oriented InP substrate. (b) Transmission cathodoluminescence 
image of region shown in (a). The dark spots not apparent in the SE image are 
dislocations. 


of p-type InGaAsP, which has at least an order of magnitude lower 
efficiency than an n-type material. Imaging of p-type InGaAsP may be 
possible using lock-in amplification coupled with beam blanking of the 
electron beam. 


IV. DEVICE WAFERS 


The TCL imaging of device wafers differs from that of substrates 
because of the presence of a p-n junction and heterojunctions within 
the wafer. The built-in electric field at the p-n junction (depletion 
region) separates the generated electrons and holes and prevents 
recombination.” Thus, in the cL, EBIC, and TCL modes, the depletion 
region is probed indirectly. In either the CL or TCL mode, the image is 
complicated by this depletion width if the p-n junction intersects the 
e-h excitation volume. For the plane view configuration where the 
electron beam is orthogonal to the plane of the p-n junction, CL 
radiation is generated in the layers confining the p-n junction, i.e. the 
layers above and below the p-n junction. Thus, the image is composed 
of three components: the cL image of the two confining layers and 
variations in the depletion width. Electron beam-induced current 
scanning, however, is primarily composed of variations in the depletion 
width and comparison of EBIC images with CL and TCL images may be 
used to locate the position of defects within the wafer. 

In this section, we will demonstrate the usefulness of TCL in evalu- 
ating the quality of device wafers used in fabricating LEDs and lasers 
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based on both the GaAs/GaAlAs and InP/InGaAsP material systems. 
Presently, TCL is used to screen out GaAs/GaAlAs wafers that produce 
unreliable LEDs; highly reliable LEDs are incorporated as light sources 
in the Western Electric (WE) 1250A optical transmitter. Additionally, 
TCL is used to screen out GaAs/GaAlAs laser wafers with a high 
density of rake lines without the removal of the contacting layer. 
Finally, TCL is used to screen out InP/InGaAsP LED and laser wafers 
with misfit dislocations. 


4.1 GaAlAs LEDs 


High radiance GaAs/GaAIAs LEDs and lasers operating at a current 
density of J = 10° A/cm’ can degrade rapidly by the development of 
dark line defects (DLDs).**"° Since these optoelectronic devices are 
intended for optical communication systems, a high yield of reliable 
devices is necessary. By detecting defects which act as sources for 
DLDs during processing, wafers with an unacceptable defect density 
may be rejected early and, thus, save processing time and cost. 

A preliminary study involving single heterostructure LEDs shows 
that approximately 12 percent of the devices fail a stress test (burn-in) 
because of the growth of DLDs, whereas the remaining LEDs have a 
lifetime in excess of 10° hours.”® Since DLDs in GaAs/GaAlAs-based 
devices are observed to originate at material defects, a high yield of 
long-lived devices may not be consistently achieved due to inevitable 
variations in substrate quality, epitaxial growth conditions, and proc- 
essing procedures. A nondestructive technique such as TCL selects 
device quality material, evaluates growth and processing procedures, 
and reveals the degradation mechanism of fabricated devices. 

The schematic of the GaAlAs LED used in the WE 1250A transmitter 
and the epitaxial structure is shown in Fig. 12. The growth of the 
GaAlAs layers by LPE and the device processing have been described 
elsewhere.”’ The 50-um-thick n-type GaAlAs carrier confinement layer 
also serves as the window and mechanical support layer. This planar 
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Fig. 12—Schematic of planar DH LED and device wafer. The shaded region of the 
device is Ge-doped. 
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structure facilitates device processing but the epitaxial growth is 
difficult. Because of the thickness of the first-to-grow window layer, 
the aluminum concentration gradient is large and nonequilibrium 
growth conditions must be controlled in forming the active and final 
confinement layers. 

The device wafers are examined with the epilayers in the as-grown 
- conditions. The substrate side of the wafers, normally rough polished, 
is bromine-methanol polished after epigrowth to reduce scattering of 
the TcL radiation. The electron beam of the SEM is scanned over the 
episide of the wafers since the quality of the epimaterial is of main 
interest. The typical 375-u~m-thick GaAs substrate does not have 
sufficient absorption to reduce the TcL intensity below the limit 
required for good imaging using the silicon photodiode. This is in 
contrast to the CL or PL scan geometry where a thin (1 to 2 ym) GaAs 
contact layer is sufficient to reduce the signal below the detection 
limit.*° No processing problems because of TCL examination of device 
wafers have been found; the wafers must be soaked in concentrated 
sulfuric acid to remove the electron beam deposited carbon prior to 
standard processing. 

Figure 13 is a TCL image of a wafer that produced a high yield of 
reliable devices. Almost no dark spots corresponding to defects are 
observed; the faint light and dark bands are the growth lamellae 
normally observed in wafers grown by LPE. Figure 14a is a TCL image 
of an LED wafer grown under nonequilibrium conditions. Defects in 
the substrate are not imaged since the cL efficiency of GaAlAs is much 
higher than that of GaAs. The epilayers were grown on low-dislocation 
(<10? cm™”) Si-doped substrates. However, as shown in Fig. 14b, an 
order of magnitude higher defect density (=10* cm7’) is found in the 
TCL image of the epilayers. These defects cannot be observed using 
Nomarski contrast microscopy and are, thus, “hidden.” An etching 
study using KOH was attempted to find the nature and origin of the 
defects. The etchant study was performed carefully so as not to remove 
the 2-um-thick, last-to-grow layer. The etch pattern obtained did not 
distinctly reveal dislocation pits but resembled the TcL image. A 
similar etch study of a grooved sample showed these defects to be 
localized to the active and last-to-grow layers. 

A TEM evaluation of the sample shown in Fig. 14a ascertained that 
the defects are dislocation tangles located within the epilayer. Figure 
14b displays one of the dislocation tangles. The bar indicates 1 pm. A 
comparison of Figs. 14a and 14b shows that the TcL defect image is 
roughly 10 to 20 times their actual size, a result of the large minority 
carrier diffusion length. It is apparent from Fig. 14b that the disloca- 
tions are very closely spaced contiguous to the center, whereas inter- 
dislocation spacing increases when going toward the edges. Further- 
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Fig. 13—Transmission cathodoluminescence image of a planar DH LED wafer-produc- 
ing reliable LEDs. 


more, stereomicroscopy reveals that the dense dislocation networks 
are essentially planar in character and are located within the active 
and last-to-grow layers. Since the networks appear to intersect along 
the [110] direction, their habit planes may be (111) and (111) planes. 
The topology observed by stereomicroscopy is consistent with this 
suggestion. 

A plausible explanation for the origin of dislocation networks is the 
following. It is suggested that at the temperature at which the growth 
of the thick window layer was completed, the active layer melt was 
supercooled in excess of 5°C. Faceted growth on {111} planes is 
initiated locally by this supercooling. Since the compositions of layers 
growing on the (001), (111), and (111) planes are likely to be different, 
accommodation dislocation networks will be generated at the conflu- 
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Fig. 14—(a) Transmission cathodoluminescence image of a planar DH LED wafer 
grown under nonequilibrium conditions. The dark areas are dislocation tangles. (b) 
Transmission electron microscopy image of an individual dislocation cluster in the 
samples shown in (a). 
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ence of these different growth fronts. This growth problem encoun- 
tered in the early stages of the planar double heterostructure (DH) LED 
development, results in a low yield of reliable devices. After adjusting 
the composition of the melts so that growth of the active and last-to- 
grow layers is under near-equilibrium conditions, dislocation tangles 
in the device wafers were no longer observed. 


4.2 GaAs/ GaAlAs lasers 


The use of TCL as a nondestructive screening technique for defects 
in GaAlAs DH laser material has been recently reported by Gaw and 
Reynolds."® Every wafer is examined with TcL prior to processing. 
Wafers with a high density of rake lines are rejected since devices with 
this defect either do not lase or have an excessive threshold current. A 
nondestructive screening technique for this growth defect is especially 
important since ~45 percent of all material grown exhibits rake lines.”® 

The LPE laser material was prepared for the 0.83-um lasers used in 
the FT-3 transmitter subsystem.’® It has the usual four-layer DH with 
a Te-doped n-ternary confinement layer. Starting from the n-GaAs 
substrate, the layers have the compositions (1) N-GaAlAs (x = 0.34 to 
0.44), (tz) P-GaAlAs (x = 0.00 to 0.10), (wiz) P-GaAlAs (x = 0.34 to 
0.44), and p-GaAs. The thickness of the p-active layer is 0.10 to 0.20 
pm and the total thickness of the p-layers is ~1.8 pm. The p-GaAs is 
used for a contact layer. This contact layer prevents wafer scanning by 
either CL or PL because of absorption within this layer. Photocurrent 
scan may be used if contacts are applied, but TcL has a higher 
resolution in addition to the advantages provided by the SEM. 

The TCL imaging of laser wafers is performed as previously for the 
LED wafers. In these wafers the back surface is polished prior to growth 
and, thus, TCL involves no additional processing. 

Figure 15 compares the TCL image of a slice having severe rake lines 
with the almost featureless SE image of the top surface. Wafers with 
such a TCL image are rejected from further processing. Figure 16a and 
16b shows the SE and TCL image of a wafer free of rake lines. In the 
absence of gross defects, the dislocations and dark spot defects which 
have lower contrast may be observed. By correlating the TcL evalua- 
tion with broad area and stripe geometry laser yields, TCL was shown 
to be an effective nondestructive screening technique for rake lines. 


4.3 InP / InGaAsP LED and lasers 


In InP/InGaAsP heteroepitaxy, the possibility of lattice mismatch 
exists. Misfit dislocations at the substrate-epilayer interface form when 
lattice mismatch is in excess of 0.06 percent.”® These misfit dislocations 
have a deleterious effect on device performance and reliability.“ In 
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Fig. 15—Comparison of the TcL image of DH laser material having severe rake lines 
with the almost featureless SE image of the top surface of the slice. 
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Fig. 16—(a) Secondary electron image of a DH laser wafer. (b) Transmission catho- 
doluminescence image of dislocations and dark spot defect in area shown in (a). 


this section, we show that TcL can be used to screen out device wafers 
with misfit dislocations present. 

Figure 17 presents the results obtained on a DH InP/InGaAsP/InP 
wafer intended for 1.3-4m devices. The wafer was lattice mismatched 
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with Aa/a = 0.12 percent. The active layer is ~0.5 wm thick and may 
be used for either LEDs or lasers. In Fig. 17a, the SE image of the top 
epitaxial layer shows growth lamellae, hillocks, and surface scratches. 
The epilayers were grown on a {111} oriented substrate. Most of the 
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Fig. 17—(a) Secondary electron image of a DH InP/InGaAsP wafer. Growth lamellae, 
hillocks, and surface scratches are observed. (b) Cathodoluminescence image of the 
region shown in (a). The dark areas are surface defects. (c) Transmission cathodolumi- 
nescence image of the region shown in (a). The vertical and diagonal lines not seen in 
the cL image are misfit dislocations. 
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features of Fig. 17a are reproduced in Fig. 17b which shows a CL picture 
of the same region. The CL image was taken with an S1 photomultiplier 
which has a spectral response similar to that of an Si photodiode. Dark 
nonluminescing regions are clearly visible. Finally, in Fig. 17c, we show 
the TCL picture obtained with an Si detector. Additional features not 
observed in either SE or CL modes of the SEM are the large number of 
sharp diagonal and vertical dark lines which form an angle of ~60° 
between them. These lines are not within the electron beam range on 
the surface since they do not appear in the CL image. It is suggested 
that they are misfit dislocation at the InP buffer layer-InGaAsP active 
layer interface. Similar TCL imaging of lattice mismatched epilayers 
grown on {100} oriented substrates reveals two sets of orthogonal dark 
lines, providing additional support for this interpretation. Additionally, 
X-ray topographs of lattice mismatched {111} oriented wafers show 
images of misfit dislocations similar to those shown in Fig. 17c.*°” 


V. DEGRADATION 


Rapid degradation of optoelectronic devices has been due to defect 
growth. The overall device and especially the light-emitting region are 
often of extremely small dimension. Etching to reveal defects followed 
by visual examination is extremely difficult because of problems in 
sample handling and resolution. It is also difficult to perform defect 
analysis using the techniques of PL scan, infrared microscopy, and 
photocurrent scan for similar reasons. Although TEM analysis may be 
the only technique that can resolve defects, preparation of samples for 
TEM becomes an art. On the other hand, SEM techniques are both 
convenient and have adequate resolution and magnification in most 
cases. However, EBIC and CL analysis may be prohibited by the 
geometry of the device. The region of interest is often inaccessible to 
the electron beam and the sample must be demounted from the 
headers. 

Figure 18 shows three device structures where EBIC and CL evalua- 
tion is difficult. Figure 18a shows either a planar LED or laser structure; 
the p-contact represents a dot for the LED and a stripe for the laser. 
Figure 18b shows an LED used for optoisolators. In the planar LED, the 
window layer is ~50-100 ym thick, thus, preventing the electron-beam 
excited carriers from reaching the active layer. In the laser structure, 
the device is usually mounted episide down for better heat-sinking. 
Even when the top n-metallization is partially removed to access the 
semiconductor, the thick substrate prevents the carriers from reaching 
the p-n junction. For the LED shown in Fig. 18b, the p-n junction is 
again located far from the excitation source. Thus, without demounting 
the devices, EBIC and CL provide little information about the p-n 
junction or active layer where the light is generated in electrolumi- 
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Fig. 18—(a) Schematic of InP/InGaAsP LEp or laser structure. (b) Schematic of a 
GaAlAs:Si LED structure. 


nescence. Once a device is demounted for analysis, it is impractical to 
rebond it for EBIC analysis. For the demounted devices, TCL is advan- 
tageous over CL because of the greater sensitivity of the technique in 
imaging defects. Examples will be shown where TCL is used to evaluate 
device degradation. 

Formation of dark spots and dark lines oriented in the (100) and 
(110) directions is a degradation mode for LEDs and lasers fabricated 
from many semiconductors, eg., GaP, GaAs/GaAlAs,?*)°*? 
GaAsP,° and InP/InGaAsP.'**°™ Dark line defects oriented along 
the (110) direction grow by glide of dislocations on the {111} slip 
planes,” ® whereas the growth of (100) DLDs may involve both glide 
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and climb of dislocations.**” Studies have shown that formation of 
(100) DLDs require minority carrier injection, whereas (110) DLDs can 
be induced with either minority carrier injection or stress. 

Studies of stress-induced DLDs in GaAs/GaAIAs DH crystals provide 
a useful background. These studies showed that a threshold stress is 
required to form (110) pLps.®~® This threshold level decreases for 
increasing minority carrier injection.©® For no injection, the thresh- 
old stress is ~8 X 10° dynes/cm”. This value rapidly decreases to 
~2 X 10° dynes/cm’ under the lower excitation level of 3W/cm? from 
a 6471 A Kr-ion laser beam, equivalent to a current density of J = 1 
A/cm’. A lower stress level of ~6 X 10° dynes/cm’ is found for J = 
58 A/cm’.™ These stress values are comparable in magnitude to other 
typical sources of stress. Stresses from dielectric coatings can be as 
high as 10°° dynes/cm’.” Stresses because of dicing damage are 2- 
7 X 10’ dynes/cm?,” and the internal stress on a mounted LED, assum- 
ing a 100°C temperature difference between header and LED, can be 
~10° dynes/cm?.” Care in processing and handling is required if stress- 
induced DLD formation is to be avoided. 


5.1 InP / InGaAsP LED 


The degradation of InP/InGaAsP LEDs by the formation of (110) 
DLDs has recently been reported by Temkin et al.’® It was demonstrated 
that this mode of degradation occurs for temperatures in excess of 
190°C even without current bias. Although DLDs may be observed in 
electroluminescence and EBIC scanning, TCL was used to show that the 
DLDs originated at defects within the InP buffer layer. Additionally, 
the DLDs were confined to the InP buffer layer and did not extend into 
the quarternary active layer during aging. 

The wafer consists of an Sn-doped InP buffer layer (n ~ 2 x 10° 
cm™*, 2 to 4 pm thick) followed by a InGaAsP active layer (not 
intentionally doped, 0.7 um thick) and an Zn-doped InP confining layer 
(p ~ 3 X 10 cm™®, 2 um thick). The p-n junction, formed by Zn 
outdiffusion from the confining layer, is placed at the interface between 
the buffer and quarternary layer. The lattice mismatch, Aa/a, between 
the InGaAsP active layer and InP was kept below 2 x 107*, and the 
wafer, thus, has no misfit dislocations to begin with. After thinning the 
wafer to ~100 pm, metallizations were electron-beam evaporated using 
shadow masks. The Be/Au p-contact, 50 wm in diameter, was made 
directly to the p-confining layer; the Au-Sn-Au n-contact was deposited 
during the same evaporation run.” Finally, a 20-m-thick gold heat- 
sink, separated from the semiconductor by a 2000-A-thick layer of 
SiO, was plated on the p-side of the wafer to assure low thermal 
impedance of the diode. The finished devices were mounted with 
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epoxy on heat-sink headers and wire bonded. A schematic of the device 
structure is shown in Fig. 18a. 

The electroluminescence (EL) images of an LED before and after the 
200°C aging (300 h and no bias) are shown in Fig. 19. Figure 19a shows 
a uniform and well-defined light spot imaged by an infrared-sensitive 
vidicon tube. No inclusions or processing-induced defects could be 
seen. Figure 19b shows the EL pattern of the LED degraded to 40 
percent of its initial light output. A large number of (110) DLDs could 
be seen throughout the entire contact area. Similar EL patterns have 
been seen in all of the 30 devices aged by 200°C storage (no bias) 
degradation. 

To find the position of DLD sources within the device structure, 
diodes which were carefully removed from their heat-sink headers 
were examined by TcL. After demounting the devices, the p-contact 
and the confining p-InP layer were removed using appropriate etch- 
ants, and devices were evaluated by TcL. Two series of TCL scans were 
performed. In the first, the exposed opening in the n-metallization was 
scanned by the electron beam of the SEM and the signal detected by a 
Ge-PIN placed under the device. The resulting image, in which CL was 
generated in the InP substrate is shown in Fig. 20a. Inclusion-like 
defects with associated DLDs can be seen. This image is similar to an 
EBIC image of the device. In the second TCL scan, the chip was turned 
upside down with the quaternary (Q) active layer facing the electron 
beam and the detector placed directly below the n-InP. In this config- 
uration, in which the CL signal originated in the Q-layer, inclusions 
and DLDs were not observed. After removal of the active layer with a 
selective stop-etch, the exposed surface of the n-InP buffer layer was 











(a) (b) 


Fig. 19—-(a) The EL pattern of an LED before aging. (b) The EL pattern after storage 
aging at 200°C for 300 h with no bias. 
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(b) 


Fig. 20—(a) Transmission cathodoluminescence image obtained on a degraded device. 
The n-InP surface is imaged in the opening in the metallization. (b) Transmission 
cathodoluminescence image of the n-InP buffer layer with the quarternary active layer 
removed by a selective etchant. 


imaged and the result is shown in Fig. 20b. Again, only the area above 
the opening in the n-metallization was imaged. The prominent DLDs 
observed previously are still visible (a mirror inversion is needed for 
comparison between Fig. 20a and 20b). However, a number of addi- 
tional features can be seen. These consist of a much greater density of 
inclusion-like defects and a large density of shorter DLDs originating at 
those defects. This DLD pattern is similar to the images obtained by 
EL. Thus, the DLDs and their sources appear to be confined to the InP 
buffer layer and its interfaces with the InP substrate and the Q-active 
layer. Since these DLDs grow without current injection, they are 
believed to be stress induced. 


5.2 Planar GaAlAs LED 


For planar GaAlAs LEDs, the TcL technique is especially important 
since dislocations and DLDs cannot be observed in either the EBIC or 
cL mode of the sEm. Although good quality cL images of the LEDs were 
obtained on our apparatus, dislocations are still not observed probably 
because of a lack of instrument sensitivity. Balk et al. showed that a 
highly efficient CL apparatus is required to detect dislocations in GaAs: 
Se;” a similar apparatus may be required to detect dislocations in our 
device wafers. Also, because of the thickness of the window layer, the 
EBIC Signal is not from primary e-h pairs generated by the electron 
beam, but from secondary e-h pairs generated within a minority carrier 
diffusion length of the p-n junction by recombination radiation from 
the primary electron-holes. Thus, the effect of dislocations on the EBIC 
signal is likely to be below the noise level. 
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Figure 21 is a TCL image of the back surface of an LED that degraded 
during the 100 h burn-in. The 50-um-diameter area defines the contact 
or light-emitting region. A (100) DLD is observed in the contact region 
along with two (110) pLDs. These DLDs were observed in the EL image. 
However, the source of the two (110) DLDs, not seen in EL, is a 
dislocation lying at the intersection of the DLDs. This dislocation, 
appearing as the dark spot indicated by the arrow in Fig. 21, intersects 
the back surface outside the contact area but probably threads through 
the light-emission region below the surface. The source of the (100) 
DLD could not be distinguished in this image; this DLD was already 
well developed as shown by the larger width in comparison to the (110) 
DLDs. The high density of dark spots representing the intersection of 





Fig. 21—Dark line defects oriented along (100) and (110) directions in a planar DH 
LED induced by current injection during burn-in. The circular region defines the 50-m- 
diameter back surface contact. 
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(a) (b) 


Fig. 22—-(a) Secondary electron image of the p-surface of a planar DH LED. The circle 
encloses the contact. (b) Transmission cathodoluminesence image showing (110) DLDs 
induced by current injection during long-term aging. The source of DLDs are the two 
scratches indicated by the arrows. 


dislocations with the surface in the vicinity of the contact and the 
identification of a dislocation as the source of the (110) DLDs in this 
sample suggest that a dislocation may also be the source of the (100) 
DLD. 

The majority of the DLDs observed in samples degraded during burn- 
in are similar to those shown in Fig. 21. Also typical of degraded LEDs 
is the high density of dislocations. It is emphasized that no dislocations 
were observed in the contact area of undegraded LEDs subjected to the 
burn-in. 

Figure 22 shows the p-surface of a device that catastrophically 
degraded after ~5000 h of operation at room temperature under 6 kA/ 
cm’ bias. From the TcL image, Fig. 22b, the method of degradation can 
be reconstructed. Two scratches were initially present, i.e. the two 
lines away from the p-contact indicated by the arrows. During opera- 
tion, a (110) DLD started from one of the scratches and propagated 
towards the contact. Upon reaching the contact area, enclosed by a 
circle in the SE image of Fig. 22a, the DLDs multiplied rapidly because 
of the high-current density. The catastrophic degradation at ~5 x 
10° h agrees with the slow growth of (110) DLDs under the low injection 
conditions away from the p-contact. Thus, TCL examination identifies 
a previously unreported method by which LEDs degrade; defects away 
from the contact area may initiate DLDs that grow into the contact 
area and rapidly degrade the device. 
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GaAlAs:Si LED 


Graded-bandgap, homojunction GaAlAs:Si LEDs are presently used 
as high-efficiency light sources for optoisolators.’”* Bias tests of these 
devices, accelerated by applying a thermal stress, demonstrated high 
reliability; with 30 mA forward current bias, a mean-time-to-failure of 
~10° h at 250°C and ~10° h at 25°C was determined. The EL pattern 
of degraded diodes appears to differ from that of unaged devices only 
in relative brightness. Both degraded and undegraded LEDs exhibit 
uniform light emission; no dark lines or spots are observed. Recently, 
however, aging studies have shown that some LEDs degrade rapidly 
when aged at 200°C even without current bias; degradation to 10 
‘percent of initial efficiency occurred within 400 h. Examination of the 
EL image of many of the most heavily degraded devices showed the 
presence of (110) oriented pLps. Thus, the drop in LED efficiency is 
attributed to nonradiative recombination at the DLDs. Since dark line 
formation has not been previously observed in graded bandgap Ga- 
AlAs:Si LEDs, TCL was used to investigate the origin and growth 
behavior of the DLDs. 

To determine the source of the DLDs, the degraded LEDs were first 
visually inspected on the headers using the SEM and Nomarski inter- 
ference microscopy. Some damage was found, but it was considered 
neither unusual nor excessive. After demounting the LEDs from the 
headers, small pyramids of various sizes were found on the p-surface 
of some of the LEDs. 

Figure 23a is an SE image of an unaged LED with several pyramids. 





50 um 50 um 


(a) (b) 


Fig. 23—(a) Secondary electron image of the p-surface of an unaged LED with 
pyramids. The sides of the pyramids are parallel to the (110) directions. 
(b) Transmission cathodoluminesence image of the region shown in (a). The dark area 
surrounding each pyramid is a dislocation network. The short dark lines are dislocations 
extending along the (110) directions. 
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(a) (b) 


Fig. 24—(a) Secondary electron image of the p-surface of a degraded LED with 
pyramids. (b) image of region shown in (a). Dark line defects oriented along (110) 
direction initiate at the pyramids. 


X-ray microanalysis on the SEM showed that these pyramids are 
composed mainly of silicon. Figure 23b is the TcL image of the region 
shown in Fig. 23a. The pyramids appear dark because of the low 
luminescence efficiency of these regions. In addition to the dark 
regions, dark lines extending from the pyramids along the (110) 
directions are observed. These dark regions and lines are part of a 
dislocation network and have no corresponding features in the SE 
image, Fig. 23a. These dislocation networks are probably a result of 
lattice mismatch between the GaAlAs epilayer and the Si pyramids. 
Figures 24a and 24b are, respectively, the SE and TCL images of the 
p-surface of typical degraded, bonded LEDs. Although Figs. 23 and 24 
correspond to two different LEDs, comparison of these figures show 
clearly that with aging, the dislocation networks originating at the 
pyramids enlarge along the (110) directions and form (110) DLDs. 
The above study was carried out exclusively on the p-surface because 
of the presence of pyramids. To determine whether the DLDs originate 
only on the p-surface, the top n-surface and {100} sides of the LEDs 
need to be examined. In unaged devices, the usual dislocations, stack- 
ing faults, and processing damage were found on these surfaces. In 
degraded devices, irrespective of the degree of degradation, TCL images 
of the n-surface were similar to those of unaged LEDs, but (110) DLDs 
were found on the {100} side faces. By examining the {100} planes of 
partially and highly degraded LEDs, the DLDs were found to initiate at 
the p-surface and propagate towards the p-n junction; the DLDs did 
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not extend beyond the p-n junction into the n-region even in the most 
heavily degraded LEDs (n/n. < 0.01). 

Figures 25a and 25b show the TCL image of the {100} side of a 
partially (n/n. ~ 0.6) and heavily degraded (n/n. ~ 0.1) LED, respec- 
tively. The LED in Fig. 25a did not show DLDs in the EL image, whereas 
several DLDs were apparent in the EL image of the LED in Fig. 25b. 
The explanation of the EL images is evident by comparing Figs. 25a 
and 25b. In both figures, the p-n junction appears as a dark line since 
the built-in electric field separates the e-h pairs generated by the 
electron beam and prevents recombination. The n-layer is brighter 
than the p-layer because of higher intrinsic luminescence efficiency 
rather than a result of degradation. In Fig. 25a, the DLDs initiate at the 
p-surface and extend up the sides. The DLDs intersect the {100} p- 
surface at an angle of 45 degrees, suggesting that the DLDs lie on the 
{111} slip planes. Dark line defects do not appear in the EL image since 
they have not propagated to the p-n junction where the EL image is 
generated. In Fig. 25b, the DLDs propagate up to the p-n junction but 
do not penetrate into the n-layer. Thus, DLDs appear in the EL image, 
but not in a TCL or EBIc scan of the n-layer. Figure 25 also demonstrates 
the ineffectiveness of EBIC in obtaining similar images. Electron beam- 
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Fig. 25—(a) Transmission cathodoluminesence image of (100) side of a degraded 
LED. Dark line defects oriented along (110) direction initiate at the p-surface and 
propagate along the (111) planes toward the p-n junction. (b) Transmission cathodo- 
luminesence image of (100) side of a degraded LED. (110) DLDs terminate at the p-n 
junction. 
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induced current images the region approximately twice the minority 
carrier diffusion length on either the p or n region. However, as shown 
in Fig. 25, the pLDs form far from the p-n junction. Additionally, the 
EBIC signal varies dramatically at the p-n junction. The small pertur- 
bations on the EBIC signal produced by the DLDs are hard to image on 
such a strongly varying background signal. 

The DLDs stop their growth upon reaching the p-n junction probably 
because of lower stress in the n-layer. Since the etch used to remove 
the saw damage during die separation removes more of the n-layer 
than the p-layer, the n-layer may have a lower surface damage-induced 
stress as found for GaP LEDs. A high internal electric field at the p-n 
junction is unlikely to hinder the propagation of the DLDs. Similar 
stress induced (110) DLDs in GaAs/GaAlAs wafers have been shown 
to propagate through the p-n junction.” Further study, e.g., an exam- 
ination of degraded LEDs using TEM or a reliability study of LEDs that 
have been etched more uniformly, is required to determine the exact 
cause of this effect. 

As a result of this TCL study, wafers with Si pyramids evident in a 
visual inspection are rejected prior to processing since the wafers 
produce unreliable LEDs. 


VI. SUMMARY 


We have demonstrated a new SEM imaging technique which we have 
called transmission cathodoluminescence (TCL). Defects within the e- 
h excitation volume are imaged as in the familiar cL mode but with a 
much higher collection efficiency and, thus, with a much higher detec- 
tion sensitivity. In addition, this technique can be used to image 
defects within the sample volume that changes the optical properties 
of the materials; these defects cannot be imaged with the CL mode. We 
have demonstrated the TcL technique on a variety of defects in a 
variety of semiconducting materials used in optical communications. 
Transmission cathodoluminesence can be used to screen out low- 
quality substrates and device wafers to save processing time and cost, 
provide rapid evaluation of growth and processing procedures, and 
evaluate degraded devices where other defect revealing techniques are 
either unsuitable or very difficult. 
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Detecting the Occurrence of an Event by FM 
Through Noise 


By V. E. BENES 
(Manuscript received March 26, 1981) 


The occurrence of an event at a random time 7 is signaled through 
white noise by an FM signal whose modulation h(t — 7) is a causal 
pulse triggered at r. Nonlinear filtering is used to find exact expres- 
sions for the chance that t > t, and the expectation of t, each 
conditioned on the observed noisy FM signal over (0, t). The former 
quantity can be used to minimize the probability of error in guess- 
ing—from the observations over (0, t)—whether 7 has occurred by t. 


1. INTRODUCTION 


The theory of frequency modulation has always been beset by 
analytical difficulties, and nowhere have these been more in evidence 
than in the area of optimal demodulation of noisy FM signals. Recent 
advances in nonlinear filtering, however, make it possible to solve 
certain problems of detection and estimation quite explicitly. We 
report on such a class of problems here. 

The basic problem setup is this: an event of interest occurs at a 
random time +. Its occurrence is signaled by sending a pulse of shape 
h(-), starting at 7; that is, we send 


0 t<7T 
ate) me t=r’ 


where /(-) is some causal, integrable pulse. The signal s(t) is trans- 
mitted by FM; the waveform is 


cos| 0 ote | sca 
0 


for a carrier frequency w and initial phase @. In transmission this wave 
suffers the degradation of having white noise added to it; thus, we 
observe a signal y, defined by 
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t 


dy, = cos + wi + | 


0 


su) [at + db,, 


with 6,a Brownian motion independent of 7. We would like to construct 
a nonlinear filter acting causally on ¥; to estimate optimally at each 
time ¢t whether 7 < ¢ or not, and if so, by how much. This filter will be 
obtained by solving the nonlinear filtering problem of determining the 
conditional probability 
pot) = P{r>tly,0Ssst} 
and the conditional density (u = distance back from f¢ to 7) 
pilt, u) = P{red(t—u)|y, OS sSt}, OSuSt. 


Such a filter (po, pi) represents a summary, without loss, of all the 
information in the “past” of{y;, 0 = s = t} that is relevant to whether 
T occurred by time ¢, and if so, how far back. In particular, the filter 
(po, Pi) yields least-squares estimates of 7, by integration over u, 
according to the formula 


uf (u)du : 


— t — 
E{r|y,0Ssst} = Pol) a + (t —u)pi(t, u)du, 


where F is the a priori distribution of 7, and f = F’ its density. The 
first term predicts where 7 will be, on the average, when it has not yet 
occurred by #; the second “postdicts” + when it has already happened 
by time ¢. Indeed, the first term is E{71,5:| ys, 0 < s = ¢} and the second 
is E{r1,<:| ¥s,0 Ss t}. 


ll. NOTATIONS 
Let x; be the process 1, so that 


_}0 if the event has not occurred by time ¢. 
: 1 if the event occurred by time ¢. 


Then with X; = f6 x.ds, the signal s(t) can be written as 


a ere _{H(X%) t= 
st) = | ae syd. = {4 ee 


and the FM signal as 

cos[@ + wt + H(X)], 
where H = fo h(s)ds. 
Ill. FILTERING EQUATIONS 


Our approach is Bayesian: foreknowledge of distr {7} is used to 
calculate the conditional probabilities po and p;. We assume for sim- 
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plicity, and with only slight loss of generality, that + has a known a 
priori distribution F with a differentiable density f. The “rate” at 
which 7+ is occurring during (¢, ¢ + A), given that it has not yet 
happened, is just 

f(t) 
——,, > 0. 
1—- F(t) 
We assume at first that the phase @ is known at the receiver. Then, the 
filtering or Zakai equations for unnormalized versions po and p: of Do 
and pi, respectively, are just 


dpo = A(t)podt + cos(@ + wt)pody: , 


A(t) = 


0 
dp: = ~ + cos[@ + wt + H(u)]oidy:, 
with initial conditions po(0) = 1, 
t 
lim | pi(t, u)du = 0 
“10 Jy 


and boundary condition pi(t, 0) = po(t)A(t). 

It can be seen that since the process ; is transient in character these 
equations are coupled in one direction only: p: depends on po via the 
boundary condition, but po in no way depends on p;. Thus, it will be 
possible to solve for po first, and then for p;. We first transform the 
problem into one without stochastic differentials. This is done by the 
now familiar device’ of looking for a solution of the form 


po(t) = exp[ ycos(4 + wt) |qo(é) 
pil(t, u) = exp{y.cos[@ + wt + H(u)|}qilt,u), OStsu, 


where go and q; are differentiable functions, though not necessarily 
C’. This form for po and p; indicates that the rough or martingale 
dependence of these functions on y(-) is confined to the exponent as 
shown, while their dependence on y(-) via go and qi is of a much 
smoother integrated form, as will be seen. 

Applying Ito’s formula to the postulated form, with quadratic vari- 
ation d(y); = dt since the observation process is a translation of the 
Wiener process, we find these nonstochastic PDEs for go and qi: 


1 
do = qo (-r0 + wysin(@ + wt) — 5 cos*(6 + wt), go(0) = 1 


a ie a +H (>t + h(u)]sin[9 + wt + H(u)] 


— 5 cos'{@ + wt + Hu)}), Ve wat, 
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The boundary condition is 
qi(é, 0) = go(t)A(é) . 
The first equation is an ODE solvable as 


go(t) = exp( - | A(s)ds + | [wyssin(@ + ws) — ; cos?(6 + vs)]ds) 
0 0 


t 
=f{1-— F¢oexe( | [wy.sin(@ + ws) — ; cos?(6 + ws)]d). 
0 
The second is a first-order PDE solvable by characteristics as 


qit, u) = A(t — wero | {wysin[@ + ws + H(s — t+ u)] 


- ; cos"[0 + ws + H(s—t+u)] 
+ yA(s — t+ u)sin[é + ws + H(s —t+ wiids), 


where A(-) is an arbitrary function. To obtain A we let u]0, and we 
use h(s —t+u) =O and H(s —t+u) =0fors<t-—uto find 


t 
q(t, 0) = Atexe( | [yswsin(@ + ws) — ; cos?(6 + vs), 
0 


by the boundary condition. Thus, A(t) = f(t), and we obtain 


t-u 


gilt, u) = f(t — wexo( { [wyssin(@ + ws) — 5 cos’(9 + ws)]ds 


0 


+| {[w + h(s — t+ u)]y.sin[é@ + ws + H(s —t + u)] 


ee 5 cos’([@ + ws + H(s —t + w))ds). 


We remark that this is the unconditional density f(t — u) that r+ 
occur at ¢ — wu, multiplied by a positive factor depending on the pulse 
shape h and the observation {y;, 0 = s < t}. The cos’ integral can be 
evaluated explicitly, leading to some simplification, and to approximate 
formulas for large carrier frequencies.” 

The normalization 


Polt) + | pilt, u)du = 1 
0 
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is achieved by dividing each of po and p, by 
g 
po(t) + | pilt, u)du, 
0 
where 


po(t) = [1 - F ()Jexn( sos + wt) + | [wy.sin(@ + ws) 
0 


— ; cos?(9 + vs)]ds) 


pilt, u) = f(t— wexp] eos + wt + H(u)] 


t—u 
1 
+ | [wy,sin(@ + ws) — 5 cos?(6 + ws) ]ds 
0 
t 
+ i te +h(s—t+u)]ysin[@ + ws + H(s —t+u)] 
t—u 


= ; cos"[9 + ws + H(s —t + wn) ds : 


If, as is likely, the phase @ is not known at the receiver, then it must 
be integrated out in both po and p; prior to normalization, a process 
that mars the relatively neat formulas obtained for po and p; for 6 
known. With @ uniform over (—7z, 7) and independent of 7, familiar 
Bessel function approximations again arise.” 


IV. HAS 7 OCCURRED YET? THE OPTIMAL GUESS 


In the kind of system under study here, a task of primary interest is 
to guess at ¢ whether 7 has happened yet. Such a guess is represented 
mathematically by a random process u;, taking the value 1 for a 
decision that 7+ has not occurred, and a value 0 for a decision that it 
has, and adapted to the past observations p{y,, 0 <= s = t}. The 
probability of error is just 


P{irst&u=13 + P{r>t &u,=9}, 
which can be written as 
E1,s(1 — v,) + Ev(1 — 1,52) 
= Elis: — 2EL sw; + Ev; 
= E(1s — uy)’, 


the mean square error in approximating 1,,,; by v;. Thus, the chance of 
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error is the least if uv; is chosen to minimize this mean square error. 
Noting that po(t) = E{1s:|ys, 0 = s = t}, we can write this mean 
square error as 


E{ po(t) — 2po(t)ur + vz} 
and conclude that a minimizing v,; is 


1 


v(i) = 1 
0! if po(t) Ss 3 


It follows that by watching po(-) we can make a best guess as to 
whether 7 has occurred yet or not, best in the sense of minimizing the 
chance of being wrong. 


V. THE CONDITIONAL EXPECTATION OF 7 


As we observe the signal ¥;, we may be interested in predicting 7 on 
the basis of the information seen so far. More precisely, since it is 
possible that at ¢ > 0 7 has already occurred, we want to simultaneously 
predict and “postdict” 7 by calculating the two terms in 


T=E{r|y,0Ss 6} 
= E{tlsilys,OSsst} + E{rla|y,0S 55 t}. 


The second term is clearly given in terms of p; by 


t 
| (¢ — u)pi(t, u)du, (u = distance back to 7 from f£) . 
0 


We claim that the first is just 


| udF (u) 


“1- F(t) * 
For with o{ ys, 0 = s < t} = Yo for short, we have 
E{rl,>¢| yo} = E{r1,5:| your > t} | yo} 
= E{E{r| your > t}1,>+| y6} 
= E{E{r|7 > t}1,52| yo} 
= polt)E{t|7 > t} 


Dolt) 


since the additional y, information in Y6 U 7 > fis irrelevant to r when 
it is known that 7 > ¢. That is, since x; = 1,5, is a Markov process, all 


2232 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


the information o{x;, ys,0< s St} is irrelevant to {x,, u >t} when it 
is known that x; = l,ie., 7 >. 
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Fault-Simulation Methods—Extensions and 
Comparison 


By Y. H. LEVENDEL and P. R. MENON 
(Manuscript received December 16, 1980) 


In this paper, we compare four different methods of fault simulation 
in terms of their handling of arbitrary numbers of logic values, 
modeling levels, and detailed timing. The methods considered are 
parallel, deductive, multilist, and concurrent simulation methods. 
Since some of the methods, in their current forms, are unable to 
handle all the problems under consideration, we have proposed 
extensions to the methods wherever necessary before making the 
comparisons. While all the methods considered are capable of solving 
the problems with the same degree of accuracy, the concurrent sim- 
ulation method appears to be the simplest and most flexible. 


1. INTRODUCTION 


Different techniques for the efficient simulation of faults in digital 
circuits have been published. Among these, the best known are parallel 
simulation,!~? deductive simulation,’ and concurrent simulation.° A few 
papers analyzing some aspects of these methods have also been pub- 
lished.*° 

This paper and two others’ comprise a series attempting a com- 
prehensive analysis of fault simulation methods. It is hoped that they 
will provide a basis for the selection of fault-simulation methods to 
satisfy specific requirements. 

In this paper, we consider three aspects of circuit modeling and their 
effects on the fault-simulation method used. First, we consider the 
number of logic values needed to accurately model logic devices and 
its impact on the simulation method. Next, the effectiveness of the 
different methods for simulating at different levels (e.g., gate level, 
functional level, subsystem level, etc.) is considered. Finally, we discuss 
the modeling of timing effects, such as rise and fall times and high- 
frequency rejection. 

Our study covers four methods of fault simulation: parallel, deduc- 
tive, multilist, and concurrent. In their current forms, some of the 
methods are not capable of handling all the problems we consider. 


10,11 
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Therefore, we have attempted to extend the existing methods, wher- 
ever necessary, before making the comparisons between methods. 
Before proceeding to the analysis of the methods, we present a brief 
description of each method. 

Historically, parallel simulation was the first method that simulated 
a number of faults simultaneously.’ This method, which is perhaps the 
most widely used, takes advantage of word-oriented operations in the 
host computer and packs together several faulty circuit values into one 
or more computer words. Although this method is quite efficient, 
multiple passes are required for simulating large numbers of faults. 

Deductive simulation attempts to eliminate the need for multiple 
passes by computing normal signal values in the circuit and deducing 
the faulty values by manipulating lists of faults.* Associated with each 
signal is a fault list, which is a set of faults, any one of which will cause 
the signal value to be different from the normal value. The effects of 
faults are propagated through the circuit by an algebra of sets. 

The multilist method associates two or more lists of faults with each 
signal.’*’? Conceptually, the number of lists associated with a signal is 
equal to the number of logic values simulated. Thus, for two logic 
values, there will be a 0-list and a 1-list associated with each signal, 
the former being the set of faults in whose presence (individually) the 
signal will have the value 0, and the latter those that result in a value 
of 1. Set algebra is necessary for manipulating these lists also. However, 
unlike the deductive method, the equations for computing the output 
lists of a device from its input lists are dependent only on the function 
performed by the device and not on the signal values. 

In concurrent simulation, any fault that causes the inputs, outputs, 
or internal state of a device to be different from their normal values is 
represented conceptually by a copy of the device. During simulation, 
if the inputs, outputs, and state of a faulty copy become identical to 
those of the fault-free copy, the faulty copy is deleted. Thus, faulty 
copies are created and deleted during simulation. The evaluation of 
faulty copies is essentially the same as fault-free copies, and no set 
algebra is involved. Concurrent simulation can also handle a large 
number of faults simultaneously. 

It is interesting to note that all the above methods, except parallel 
simulation, use some form of data compression for storing faulty signal 
values. On the other hand, parallel simulation attempts to compute 
simultaneously the fault-free signal value and a number of faulty signal 
values associated with each lead in the circuit. 


ll. NUMBER OF LOGIC VALUES 


Three-valued logic systems have been widely used for analyzing 
essentially binary systems.’*“ Three logic values are also used in logic 
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simulation, where 0 and 1 represent the two discrete values being 
modeled and a third value, u, denotes that a particular value is 
unknown. 

Recently, tri-state busing has become a widespread technique used 
in many LSI designs. Difficulties in modeling effects associated with 
cmos technology have been reported.” One effect is the memory 
associated with a disabled bus. That is, the disabled bus remembers 
the previous logic value on the bus. A solution consists of adding 
special circuitry to regular gates, making possible the use of a simulator 
with only three logic values.” An alternate solution is the addition of 
three more logic values, namely Zo, 21, and z, for representing the states 
of disabled buses, with previous value equal to 0, 1, and unknown, 
respectively.’© Transistor-transistor logic (TTL) tri-state technology 
requires the addition of only one logic value, z.’® 

Bus contention, another typical, potentially destructive tri-state 
effect, cannot be modeled by added circuitry. A solution consists of 
adding one more logic value representing a conflict state, a, as shown 
in the following example. 

Consider a driver inverter and a bus configuration in TTL tri-state 
technology (Fig. 1). When line e is enabled, the gate operates as an 
inverter, when e is disabled the output of the gate is in a high- 
impedance state. When used in a bus configuration, two enabled 
inverters create a conflict (bus contention), if they are in opposite 
states. The set of logic values {0, 1, u, a, z} is sufficient to model these 
effects, since tri-state devices in TTL technology do not have the 
memory property mentioned above. 

Table I shows how the bus configuration of Fig. 1 can be simulated 
using the above set of five logic values. Since the bus will be connected 
to the output of drivers, which can produce four out of the five logic 
values, only four logic values are used for modeling the bus. 


ey 


(a) = 


(b) 
Fig. 1—(a) TTL Driver-inverter. (b) Bus configuration. 


FAULT SIMULATION 2237 


Table I—(a) Tristate inverter output 
(b) State of tristate bus 





If an ordinary gate could be connected directly to a bus, the model 
should allow five logic values for the gate inputs, but requires only 
three logic values for its output. Table II shows the behavior of such 
an AND gate with inputs x and y, and output ¢. 

The use of larger sets of logic values, though necessary to correctly 
model modern technology, has a serious impact on the method of 
simulation used. The following sections deal with this problem. 


2.1 Parallel simulation 


When using a switching algebra (i.e., two logic values) parallel 
simulation can be implemented by associating one computer word 
with each line in the circuit. One bit of this word represents the signal 
value on a line in the fault-free circuit and the remaining bits represent 
values on the same line in the presence of different single faults. 


Table II—AND gate with five input 
logic values 
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Table III—Coding for three logic 
values 


unknown 


0 
unused 





When a three-valued system is used, each of the circuits simulated 
in parallel must be coded using two binary digits. A commonly used 
method consists of associating two words with each line a, namely the 
0-word, a°, and the 1-word, a’.’” The coding used is shown in Table III, 
where the subscript i refers to the ith bit of each word. 

Examples of its use are shown in Fig. 2. Here, and elsewhere in this 
paper, lower-case roman letters are used to denote leads and Greek 
letters represent words. For the gates of Fig. 2, we have 


Veo 
y=a'-B 

d° = a B° 

3 =a'+ p 
=a 

(é =a’, 


where - and + represent the bitwise AND and OR operations on 
complete words. 

This method can be extended for any number of logic values. For 
instance, consider the AND gate of Fig. 3, using the set of logic values 
{0, 1, u, a, z}. The binary coding scheme requires three computer 
words for each line, and three codes (out of eight) are not used. For 


5°. 


ofa, afer 
i ae 


NOT 


Fig. 2—Use of coding to represent signal values on gates. 
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Fig. 3—AND gate representation for five logic values. 


any choice of code, it is possible to calculate the gate output from 
switching expressions of the following form: 


y~ = f(a’, a’, a’, ig B’, B’) 

y' as 2(a°, a’, a’, B°, B’, B’) 

y _ h(a, a’, a’, B. B', B”). 
The original set of logic values and operations do not constitute a 
Boolean algebra. The coding scheme establishes a mapping of non- 
Boolean functions into switching operations that can be applied on full 
computer words, thus, allowing parallel processing. 

By using a coding of n — 1 variables to represent n logic values, it is 
possible to obtain simpler equations for computing the outputs of 
gates. For example, consider the gate of Fig. 3 and a coding using four 
words a°, a’, a“, and a’ to represent five logic values. The code is such 
that a? = 1, 7 = 0, 1, a, or 2, iff a; = 7. All the variables will be zero, if 
and only if a; = u. With this coding, the following equations are 
obtained for the gate of Fig. 3: 


y =a’ + p° 
Y= ap 
y=0 
y=0. 


This type of coding can be used for any number of logic values. 


2.2 Multilist simulation 


It has been shown that for a three-valued logic system, three lists 
X°, X1, and X“ can be associated with each line x.’*”” Each list, X‘, 
(i = 0, 1, uw) represents the faults which cause line x to have the value 
i. For each line x, all the lists X’ are disjoint and any list is the 
complement of the union of the other two (i.e., the union of the three 
lists is the set of all faults being simulated). 

For the gates of Fig. 2, we have 
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C=ANB yao 
lolaue i 
D=AUB 1, a 
eae D'= DUD 
i 0 —— 


where ~, U, and NM, are set complement, union, and intersection, 
respectively. 

When five logic values are used, we need five lists; for instance, A°, 
A’, A”, A%, and A’ are associated with line a. 

For the AND gate of Fig. 3, we have 


C° = A°U B® 
C'=A'N B' 
Cat. 3 
C=t 3 


C’=(PUCIUCUCY =A°UBUA'NB'. 


For the inverter of Fig. 1, we have 


B=E'N A! 

B}=E'n A° 

B? = E° 

BY = E“U E*U E7U (E'N (A°U Ad) 
Be={ }, 


and for the bus configuration of Fig. 1 

A$ = (BY N B%) U (BY N Bs) U (B2 N Bi) 
A} = (Bi N B2) U (Bi N B3) U (B2N Bi) 
Ag = BY U Bs 

A3 = BIN Bs 

A§ = (B2N Bi) U (B2N B)). 


This method can be generalized to any gate type and any number of 
logic values as follows: Let us assume that we wish to simulate a 
function f(x1, X2,-++, Xn), where each input and the output may 
assume any one of k values, denoted by 1, 2,---, k, and that the 
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function is defined by a table which specifies the values of f for all 
combinations of values of xj. 

_ (t) We associate a variable x? with each variable x;, such that 
x? = 1if and only if x; = /, 1 < 7 = k. Similarly, we associate k variables 
f? with f. 

(it) For each i, 1 Si S k, we obtain an expression 
f = > P D 


where P; are products of literals x7’, representing all combinations of 
values for which f = i. For example, if the table has an entry 


H=1 %.=0,%,=2, f=1, 
the expression for f' will contain the term 
X1 XIX. 


(iii) Replace all lower-case letters in the equation for f' by the 
corresponding upper-case letters, representing lists, and retain the 
superscripts and subscripts. Replace products by intersection and sums 
by union. 


2.3 Deductive simulation 


Deductive simulation is well defined for two logic values, and is also 
applicable to three logic values with some loss of information.‘ Specif- 
ically, if the signal value in the normal circuit is known, (i.e., 0 or 1), 
but the value in the presence of a fault a is unknown (denoted by uw), 
the fault a is included in the fault list as a star fault;*”® that is, it is 
unknown whether the particular signal value in the presence of the 
fault a will be different from the fault-free value. It was shown in Refs. 
10 and 12 that there are cases where the circuit value in the normal 
circuit may be unknown, but the value in the presence of a fault may 
be known. Since the deductive method cannot represent this case, the 
results obtained may be less accurate than with other methods.’”” 

A modification of the deductive method that leads to accurate three- 
valued fault simulation was presented in Ref. 10. It uses the coding of 
Table III for representing each signal value by a pair of binary 
variables. A pair of equations can then be derived, as in Section 2.1, 
for computing the coded outputs for each gate type. These equations 
can be viewed as defining a transformation of the original circuit with 
three signal values into two circuits that will have only binary signals. 
These two circuits can be simulated using the two-valued deductive 
method. The fault-free and faulty signal values on any lead in the 
original circuit can be determined from the signal values and fault lists 
associated with the corresponding pair of leads in the transformed 
circuits. 
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Fig. 4—Tristate bus driver. 


The same approach can be used for performing deductive fault 
simulation with any number of logic values. If k logic values are to be 
simulated, [logsk] binary variables will be used to represent them, 
where [x] denotes the smallest integer greater than or equal to x. The 
equations for the coded outputs of different gate types can be derived 
from their truth tables, and used in deductive simulation. 

As as example, consider the bus driver of Fig. 4 to be simulated with 
four logic values, namely, 0, 1, z (high impedance) and u (unknown). 
The behavior of the device is specified in Table IV. 

Using the coding of Table V, we shall represent the signals a, e, and 
b of the bus driver by ap and aj, eo and e, and bo and b;. The output 
equations bo and b, can be derived from Tables IV and V. 


bo = €0°@1 + Ao- 0° ei 
by = €o-€) + A\-€° ey 


For any combination of input values and fault lists, the output values 
and fault lists can be computed as in Ref. 19. 

Denoting the fault list associated with each variable by the corre- 
sponding upper-case letter, let the input values and fault lists for the 
circuit of Fig. 4 be as follows: 


ao =0; Ao = {1, 3} 
a=1; Ai = {3} 

€o =0; Eo = {2, 4} 
€e=1; E, = {4, 5} 


Table IV—Behavior of bus 
driver 
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Table V—Coding for tristate devices 





Let us assume that all the faults being considered are external to the 
device, and we wish to propagate the effects of the faults through the 
device. The input conditions are: a = 1, e = 1. Since the fault 1 is 
contained only in the fault list Ao, it will cause a to become 1, and 
therefore a to become z. On the other hand, fault 3 is contained both 
in Ao and Aj, and will cause both ap and a; to be inverted; the value of 
a in the presence of fault 3 will be 0. Similarly, fault 2 will result in 
e = 2, fault 4 in e = 0, and fault 5 ine = u. 

For the above set of values, the output values and fault lists can be 
computed using the equations for bo and 6; and the method presented 
in Ref. 19 as follows: 


by = 0 
b=1 
Bo = (Ao N Eo N Ey) U (Eo N E1) = (1, 3, 4} 
B, = (Ai U Eo U Ey) N (Eo E)) = {2, 3, 5}. 


Denoting the value of b in the presence of fault a by b(a), we can 
obtain the following faulty values from the values 0 and 6; and fault 
lists Bo and By}. 


6(1) =z; 6(2)=u; 6(3)=0; 6(4)=2; 6(5) =u. 


These can be verified by computing the output for each faulty com- 
bination of inputs using Table IV. 

The modified deductive method discussed above does not lose any 
information about the normal and faulty circuits and is as accurate as 
any of the other methods. It requires only [log27] lists compared to the 
n lists needed for multilist simulation. However, fault list computations 
depend on signal values and may be more complex than in the multilist 
method. 


2.4 Concurrent simulation 


There is no limitation on the number of logic values in this simula- 
tion method since faulty and fault-free circuits are treated independ- 
ently. As long as the primitive elements of the circuit are well defined, 
the evaluation of faulty circuits presents no difficulty. 
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2.5 Summary of results 


The results of Section II are represented in Table VI. 

Deductive simulation with three logic values (indicated by * in 
Table VI) requires the introduction of the concept of star faults. 
Deductive fault simulation for more than three logic values (indicated 
by + in Table VI) could be defined by using a transformed circuit as 
proposed in Section 2.3. However, the complexity of such a procedure 
does not seem to justify its use. 

From the point of view of simulating more than three logic values, 
concurrent simulation represents the simplest, most flexible simulation 
method. 


Ill. MODELING LEVELS 


Three levels of modeling and their effects on the simulation method 
used will be considered: gates, higher-level primitives, and user-defined 
functions. 


3.1 Gate-level simulation 


All the fault-simulation methods presented here were initially de- 
veloped for simulating circuits modeled at the gate level. Therefore, 
none of the methods presents any problem, provided only two (or 
three) logic values are to be simulated. The differences due to the 
number of logic values needed have already been discussed in Section 
II, and the effects of detailed timing analysis are discussed in Section 
IV. 


3.2 High-level primitives 


It is often convenient to model devices such as flip-flops, multiplex- 
ors, counters, and shift registers as high-level primitives rather than as 
interconnections of gates. For purposes of simulation, such devices 
may be described by tables, Boolean equations, or algorithms. The 


Table VI—Summary of Results: logic values 


Prat 


Switching One word per | Two lists Well defined Well defined 
algebra line per line 


Three logic Two words Three lists | Well defined but Well defined 
values per line per line pessimistic* 











Five logic Three words Five lists Undefinedt Well defined 
values per line per line 





n logic [logon] words | n lists per Undefined} Well defined 
values per line line 





* Deductive simulation with three logic values. 
+ Deductive fault simulation for more than three logic values. 
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type of representation that is most convenient to use will usually 
depend on the simulation method. 


3.2.1 Parallel simulation 


Several solutions are possible. The input values for individual faults 
can be determined from the input word(s), and the outputs of the high- 
level primitive can be evaluated for each case. The output values must 
then be packed so that the parallel simulation method may be used 
elsewhere. While this approach may be satisfactory for predominantly 
gate-level circuits which also contain a few high-level primitives, the 
overhead associated with converting to single-fault simulation and 
back to parallel simulation may not be acceptable. 

When only two logic values are involved, the primitive can be 
represented by Boolean (switching) expressions. The operators in 
these expressions can be treated exactly like gates in parallel simulation 
of gate-level circuits. When more than two logic values are simulated, 
the description of the primitive may be in the form of tables. Using a 
coding of the type described in Section II, switching algebraic expres- 
sions for the coded output words (i.e., the 0-word, 1-word, etc.) can be 
obtained in terms of the coded words associated with the inputs and 
state variables. These equations can then be used to compute the 
coded output words. 


3.2.2 Multilist simulation 


The function realized by a high-level primitive can be represented 
by tables. From these tables, equations for the output lists in terms of 
lists associated with inputs and state variables can be obtained as 
discussed in Section 2.2 and used for simulation. 


3.2.3 Deductive simulation 


As in the case of parallel simulation, one approach is to simulate 
each high-level primitive for one fault at a time and use the results to 
construct output fault lists for use outside the primitive. Alternatively, 
outputs and the next state values of state variables may be represented 
by Boolean equations, which are used for fault-list computations in 
the same manner as gates. When more than two logic values are to be 
simulated, a binary coding can be used as discussed in Section 2.3, and 
equations for each coded bit can be used for fault-list computations. 

Another possibility is to use tables that specify the fault-list com- 
putations for every combination of input values.* These tables can be 
constructed from the tables specifying the primitive, as shown by the 
example of Fig. 5. 

The behavior of the function is represented in Table VII, where py, 
qi are the initial states of the flip-flops and pe, g2 are the next states. 
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Fig. 5—NAND sr latch. 


Table Vil—Behavior of SR latch 





1 0 1 0 0 1 1 
2 0 1 0 1 1 0 
3 0 1 1 0 0 1 
4 0 1 1 1 0 1 
5 1 0 0 0 1 1 
6 1 0 0 1 1 0 
7 1 0 1 0 0 1 
8 1 0 1 1 1 0 
9 1 1 0 0 1 1 
10 1 1 0 1 1 0 
11 1 1 1 0 0 1 
12 1 1 1 1 u u 


The table does not include 00 as an initial state because it cannot be 
produced directly. 

The fault-list propagation is summarized in Table VIII and does not 
include local faults. P;, Qi, R, S, Po, and Q2 are the fault lists associated 
with pi, gi, 7, 8, P2, and qe. The star faults in the table are to be added 
to both the fault lists, P2 and Q@2. We shall demonstrate the procedure 
used for deriving Table VIII, by showing how line 1 of the table was 
obtained. 

Consider line 1 of Table VII. To get a change in p2, we need a change 
into lines 3, 4, 7, or 11 and for a change in g2, we need a change into 
lines 2, 6, 8, or 10, which produces: 


P, = P,Q,RS U P,QiRS U PiQ.RS U PiQ,RS U P,Qi(®) 
=RSUP,R 

Q. = P,Q,RS U PiQ:RS U P\Q,RS U P,Q,RS U P,Q:(®) 
= RSU QS, 


where juxtaposition represents intersection. Since a change to line 12 
is needed to cause pe and g2 to become unknown, the star faults are 
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Table Vill—Fault-list equations for SR latch 


AMSA UM kh wD 


oo 








given by 


PiQRS. 
We have used P,Q,, which corresponds to the initial state 00 in the 
faulty circuit, as a don’t-care state (®) to simplify the expressions. 


3.2.4 Concurrent simulation 


In concurrent simulation, the same method is used to evaluate fault- 
free and faulty circuit signal values. Therefore, no transformations of 
representation are necessary, and any representation that leads to 
efficient simulation may be chosen. 


3.3 User-defined functions 


Our discussion of Section 3.2 also applies to user-defined functions. 
The main difference is that the tables or equations used for represent- 
ing the functions must be generated from descriptions in a high-level 
language such as the function definition language in LAMP.” 

A typical construct in such a language is the cause-effect statement. 
Such statements can be nested to many levels. The techniques dis- 
cussed in Section 3.2 can be used for simulating user-defined functions 
using the parallel, multilist, or deductive method by first replacing 
cause-effect statements by equivalent equations. For example, the 
statement 


ifx then z=a else z=) 
can be replaced by 
z=ax+b-x+a-b. 
The redundant term a-b has been introduced to produce the correct 
result z = 1 for the case a = b = 1 and x = u.”! Otherwise, the 


pessimistic result z = u will be produced for this case. 
Concurrent simulation does not require the transformation of cause- 
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effect statements into equations. For each fault and each combination 
of inputs and state, only those computations enabled by the conditions 
need be performed. The operations in a function definition need not 
be restricted to logical operations. Therefore, it is not necessary to 
generate Boolean equations corresponding to arithmetic operations, as 
would be necessary in the other methods considered. Thus, it appears 
that the concurrent method would allow simulation of functions de- 
fined at a higher level than is possible with the other methods. 


3.4 Summary of results 


The results of this section are summarized in Table [X. The concur- 
rent method is clearly superior in its ability to simulate different levels 
of models. 


IV. TIMING 


In this section, we study different effects related to timing, and their 
impact on the four simulation methods under consideration. We shall 
consider the effects of different rise and fall times associated with 
signal changes, suppression of short pulses to model inertial delays, 
and the simulation of faults which affect the magnitude of delays 
associated with devices. We shall restrict our discussion to logic 
simulation with two and three logic values. 


4.1 Rise and fall times 


The delays associated with 0 to 1 and 1 to 0 transitions of a signal, 
called here the rise and fall times, are not necessarily equal.”””’ All the 
methods of fault simulation under discussion simulate a number of 
signals simultaneously, some of which may be rising and some falling. 
To simulate this effect accurately, a mechanism is necessary for 
allowing rising and falling signals to change at different times. 

Let f be a time before any change occurs on the line under consid- 
eration. Due to differences in the rise and fall times, signal changes 
may occur on the line at times ¢; and fe, where f < t; < ft. Thus, at 
time ty, all signal changes associated with the particular event would 
have occurred. The effect of different rise and fall times can be 


Table IX—Summary of results: modeling levels 


Higher level 
primitives 

















ser defined 
functions 


Note: 1 = No transformations required. 
2 = Transformation into equations required. 
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simulated accurately by computing the values of the signals at time ¢, 
based on the values at é and ¢2, namely, the initial and final values for 
the particular set of transitions. 

In the following sections, we shall consider four simulation methods 
and examine the results produced by their different models at three 
points in time, namely, before ¢,(é), between #4; and ¢é2(¢:), and after 
to(t2). 


4.1.1 Parallel simulation 


Let &, &, and & be the words associated with a line x at times fo, t), 
and f2, in two-valued parallel simulation. In three-valued simulation, 
two words denoted by superscripts 0 and 1 will be associated with the 
line for each of the above times, and the coding of Table III will be 
used. 

Case 1: Rise time < fall time. We have 


£ = fo + & 
for two-valued simulation, and 
a= f+ & 
gr = 0-8 
for three-valued simulation where + and - represent bitwise OR and 


AND performed on full words. 
Case 2: Fall time < rise time. We have 


£1 = £o-& 
for two-valued simulation, and 
fi = &-& 
él = £0 + & 
for three-valued simulation. 


The preceding formulas can be verified by checking all nine possible 
transitions between the set {0, 1, uw} and itself. 


4.1.2 Three-list method 


Let Xb, Xi, and X} be the i-lists at times fo, ti, and to, defined above, 
for 1 = 0, 1, u. Using the same arguments as in Section 4.1.1, we obtain 
the lists for time ¢, as given below. 

Case 1: Rise time < fall time. We have 


Xi=Xju Xi 
X? = XIN XS. 
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Case 2: Fall time < rise time. We have 
Xi = XIN X2 
XP = XOU X?2. 
In both cases, X{ = (XT U X). 


4.1.3 Deductive simulation 


Deductive simulation with different rise and fall times has been 
discussed by Kjelkerud and Thessen.” Here we present an alternate 
method. 

Let the times éo, t;, and f2 be as defined earlier and let x; and X; 
represent the signal values and fault lists at those times, z = 0, 1, 2. If 
the rise time is less than the fall time, all 0 to 1 transitions will occur 
at f;. Therefore, we have 


X1 = Xo + Xo. 
Similarly, if fall time < rise time, 1 to 0 transitions will occur at ¢,, and 
x, = 1 if and only if it remains at 1 throughout the transitions. 
Therefore, for this case 
X= Xo°X2. 
The fault lists X; at time ¢, for different signal changes in the fault-free 


circuit and different relative values of rise and fall times can be 
determined from these equations. They are summarized in Table X. 


4.1.4 Concurrent simulation 


In fact, concurrent simulation is a trivial case, because fault-free and 
and faulty circuits are simulated independently. Rising and falling 
edges will still occur in distinct event waves, but the treatment of these 
events is individual. 


4.2 High-frequency rejection 


High-frequency rejection consists of eliminating short pulses for 
modeling the effect of inertial delays. We consider events occurring at 


Table X—Fault-list equations for handling different rise and fall times 


Rise Time < Fall Time Fall Time < Rise Time 


Rising edge n=1 m=O _ 
Xo = 0; x2 =1 Xi = Xo Xe Xi = XoN Xe 


Falling edge xy=1_ x= 0 
X= 1; x= X1 = XoNn X2 X, = XoN Xe 


x=1 m=l1 
X; = XoN X2 1= Xo U Xe 


x,=0 m1 =0 
X, = Xo U Xe Ay = XoN Xe 





Fault-Free Circuit 


















Constant one 
Xo=1,x2=1 






















Constant zero 
Xo = 0; x2 = 0 
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times fo, f1, and f2, where to < ¢; S ¢2 and the logic values at these times. 
If t,2 — ft, is less than the magnitude of the inertial delay, then the 
change at ¢; must be rejected to suppress short pulses and the value 
between #; and fe, x1, will be replaced by a corrected logic value, x1,. If 
Xo, x1, and x2 are the computed signal values at fo, ¢:, and f2 respectively, 
and if t: — ¢, is less than the inertial delay, the corrected signal value 
at time f; is given by: 
Xin = X0X1 1 X1 Xe + XX. 

The method for performing high-frequency rejection can be derived 
from this equation. 

In case there are more than two events within the range of the 
inertial delay, the treatment elaborated above must be repeated for 
each pair of events within that range. For example, consider three 
events occurring at times @), f2, and ts. The following triples will be 
considered: (fo, ti, f2), (to, ti, ts), (ti, te, ts), which represent three pairs 
of events. 

4.2.1 Parallel simulation 

For two-valued parallel simulation, the word é, has to be replaced 
by 

fin = S081 + E182 + Soke 


and for three-valued parallel simulation, &? and é{ are replaced by 
fin = St + GG + HE 
In = SOE] + 92 + E082. 


The coding defined in Table III was used to obtain the above equations 
for three-valued parallel simulation. 


4.2.2 Three-list methods 


Using the equation for x1, given above, we obtain the following fault- 
list equations: 


X$n = (X9 NM XM) U (XIN X2) U (XO N X23) 
Xin = (XON XI) U (XIN XB) U (XON X3) 
Xt, = (XY U X}). 

4.2.3 Deductive simulation 


The deductive fault list Xi, can be obtained from the equation for 
the new signal value x, the values of xo, x1, and X2, and the associated 
fault lists. The fault list X,, can be computed in the same manner as 
fault propagation through functional blocks.’ In fact, high-frequency 
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rejection may be thought of as being performed by a filter whose 
equation is given above. 

The fault-list computations for the eight possible patterns of xo, x1, 
and x2 are summarized in Table XI. 

As an example, consider the case x» = 0, x; = 1, x2 = l(line 4 in 
Table XI). We have 


Xin = XoX1 + X1X2e + HX. =atbt+e. 


The fault lists associated with the terms a = xX0x, 6 = X,%2, and c = 
xXoX2 for the specified values are: 


A=XNX; B=X%,UX; C=XNX. 
Therefore, 
Xin = AN BOC& (Xo UX) N (XU Xo) N (Xo U Xo) 
= (Xo N Xi) U (Xo N X2) U (Ki NX). 


4.2.4 Concurrent simulation 


In this case, each faulty signal value is computed separately. There- 
fore, high-frequency rejection can be performed on each signal individ- 
ually, using the equation for x,, given in the preceding section. 


4.2.5 Suppression of short-duration detections 


We have considered the suppression of short pulses produced inde- 
pendently by each fault-free or faulty signal value. However, we did 
not consider the case of a short pulse of detection, when neither the 
faulty nor the fault-free signal incurs a pulse. This is illustrated by the 
case where Xo = 1, x) = 1, x2 = 0, for the fault-free signal and xo = 1, 
x1 = 0, x2 = 0, for the faulty signal. This causes a short detection 
between ¢, and tz. For deductive simulation, this short detection may 
be eliminated by using the formula 


Xin = (Xo NM X}) U (Xo a X2) U (Xi M X2) 
independently of the fault-free signal pattern and after the high- 


Table Xl—Fault-list equations for high-frequency rejection 
x1 X2 Xin Xin 


(Xo NX) U (XN X2) U (Xo N Xe) 
(Xo MX) U (Ki N Xe) U (Xo N Xe) 
(Xo NM Xi) U (XN X2) U (Xo N X2) 
(Xo NX) U (X1 N X2) U (Xo N Xe) 
(Xo NX) VU (XN Xe) U (Xo N Xe) 
(Xo NX) U (XN X2) U (Xo N Xo) 
(Xo N Xi) U (XN Xo) U (Xo N Xe) 
(Xo NX) U (XN Xe) VU (Xo N X) 


53 
2° 


See OOOO 
He OORF OO 
KH OrFOROrS 
BPR OHM OoOOo oO 
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frequency rejection has been performed. The term X; is the set of 
faults detected at time £j. 

The same method may be used for all the other simulation algo- 
rithms described earlier. 


4.3 Delay faults 


A fault that affects the transport delay associated with a signal is 
called a delay fault. Consider a fault that causes a delay to change 
from d to d’. When the signal at the site of such a fault changes, the 
signal value corresponding to the particular faulty circuit must be 
delayed by a’ instead of d. 


4.3.1 Parallel simulation 


Two aspects of delay faults must be considered: injection of delay 
faults and the propagation of the effects of delay faults. Let us assume 
that a gate which is the site of a delay fault has been evaluated at time 
t, and the jth bit of the word represents the circuit with the delay 
fault. Let the normal and faulty delays be d and d’, respectively, and 
let d < d’. Let x = (1, X2,°++, Xn) and x’ = (xj, x2,---, xn) be the 
vectors representing the old and new values, respectively. If for any 
i J, xi A xi, a vector (x1, X4,+++, Xj-1, Xj, Xj+i,+*+, Xn) will be 
scheduled to be applied to the gate output at time ¢ + d. If x; # x,;, the 
vector x’ will be scheduled to be applied to the gate output at time 
t+ d’. Similarly, if d’ < d and x} ¥ x,, the vector (x1, x2, +++, Xj-1, X}, 
Xj+1,°**, Xn) will be scheduled for time ¢ + d’. If x} A x; for any i # J, 
then the vector x’ will be scheduled to be applied at time ¢t + d. The 
vectors for updating at the different times can be obtained from the 
old and new vectors by appropriate masks and logical operations. 

From the above discussion it should be clear that the effect of delay 
faults is to cause the signal values on the same lead in the presence of 
delay faults to change at different times. When one or more values in 
a vector change, the gates to which the signal fans out in the fault-free 
and all faulty circuits are scheduled for evaluation in parallel. There- 
fore, no special treatment is necessary for propagating delay faults. 
Since any signal change at the inputs of a device, faulty or fault-free, 
will cause an evaluation of the device (faulty and fault-free), delay 
faults will tend to increase the number of evaluations required. 


4.3.2 Three-list method 


The equations required for simulating delay faults using the three- 
list and deductive methods can be derived by treating the delay fault 
as an internal fault in a functional block. These equations can then be 
used for simulating delay faults without explicitly modeling them as 
faults in functions. 
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Let a be a delay fault which causes the delay associated with a signal 
to change from d to d’. Let f, be a fault variable,’? which has the 
fault-free value of 0, but the fault a causes it to become 1. We shall 
represent the input and the output of the function used for modeling 
the delay fault by x and z, respectively. We assume that the evaluation 
is being done at time ¢ = 0, and the value of the input x, ¢, units of time 
earlier is represented by x(—t,). Two different cases must be consid- 
ered: 

Case 1: d < d’. The value of z at time d is given by 


if f, then z = x(d — d’) 
else z = x, 


which can be transformed into the equation 
2(d) = farx(d — d’) + farx. 


Case 2: d’ < d. The value of z at time d’ can be represented by a 
function as in Case 1, and transformed into the following equation: 


2(d’) = farx + farx(d’ — d). 


The equations for the three-list method can be obtained from these 
equations using the method discussed in Section 2.2. 


Case 1:d<d’ 
Z*(d) = [{a} N X*(d — d’)] VU [X'N Tah] 
2°(d) = [fa N XU [XN Xd — d’)] U [Xd - d’) N {a} 
Z(d) =[Z(d) U Z°(d)). 
Case 2: d'’<d 
Z'(d’) = [{a} N X"] U [X*(d’ — d) N Ta¥] 
Z°(d’) = [Tay N.X*] UV [XN Xd’ — d)] U [X°(a’ — d) N {a}] 
Z(d') =(Z(d‘) U 2a). 
4.3.3 Deductive simulation 


The functional equations derived in Section 4.3.2 can be used for 
deriving deductive fault lists for delay faults. The fault-list equations 
will depend on signal values as shown in Table XII. 


4.3.4 Concurrent simulation 


Since each fault is handled separately, the simulation of delay faults 
does not need any special processing. 
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Table Xll—Fault-list equations for delay faults 





Rays 2) Z(d) 

0 0 0 [X(d —d’)N {a}]U [xa a 

0 1 0 [X(d=d)N {a}]U |x (J 

1 0 1 | xn cit [X(d — d’) N {a} ] UIXNX@—a)] 

1 1 1 Xd a’) 9 (ay UIXA Xa’ ~ aJU] XO oJ| 
Case 1:d<d’ 

x x(d’—d) 2d) Zid) 

0 0 0 [XN {a}]U [x -d)n tJ 

0 1 1 [XN X(d’-d)]U[KN fay} xa’ — an fF] 

1 0 0 Bn @)Ju] Ka - ayn oJ 

1 1 1 | xa - a afe| U[XN {a}] U [XN X(d’ — d)] 
Case 2:d’<d 


4.4 Summary of results 


Changes in the fault-free and faulty values may occur at different 
times for the same line due to different rise and fall times and to delay 
faults. In the case of parallel simulation, a change in a single faulty or 
fault-free value on a line leads to computations involving the whole 
word (or pair of words). In the three-list and deductive methods, the 
addition or deletion of a single fault will require recomputation of 
complete lists. On the other hand, concurrent simulation treats each 
event, faulty or fault-free, independent of all other events and, there- 
fore, should require less computation. High-frequency rejection is also 
simpler in concurrent simulation than in the other methods. 


V. CONCLUSION 


We have compared parallel, multilist, deductive, and concurrent 
simulation methods with regard to their ability to simulate more than 
two logic values, different levels of simulation, and accurate timing 
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analysis. All the methods, except deductive, can handle any number of 
logic values without significant changes in the method. An extension 
of the deductive method to an arbitrary number of logic values is 
presented. Concurrent simulation appears to be the most convenient 
method of simulating an arbitrary number of logic values. 

All the methods, except concurrent, require the transformation of 
functional descriptions of high-level devices into Boolean equations. 
No such transformation is required for concurrent simulation. In fact, 
it is not even necessary to restrict operations in functional descriptions 
to Boolean operations if concurrent simulation is used. 

All the methods are capable of handling different rise and fall times, 
performing high-frequency rejection and simulating delay faults. Since 
concurrent simulation handles each event separately, these functions 
can be performed more easily and efficiently than the other methods. 

In addition to the aspects discussed here, two factors that must be 
considered in selecting a simulation method are storage requirements 
and speed. A detailed analysis of the speed and the storage require- 
ments of these methods is made in Ref. 11 based upon statistical data 
gathered from deductive simulation. 
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Although a sequential circuit M reduces to a combinational net- 
work Cy after all feedback paths have been cut, an application of 
Bossen and Hong’s checkpoint labeling procedure to Cm does not 
necessarily yield a minimal solution. The set of checkpoints so ob- 
tained will include all feedback lines. In this paper, it is shown that 
these feedback lines are not necessary checkpoints under a “delay 
equivalence” relation. In addition to this, we also show that not every 
fanout branch is a necessary checkpoint. Any “singular fanout 
branches” can be removed from consideration. The results of our 
analysis lead to a minimal checkpoint labeling procedure for sequen- 
tial logic networks. 


1. INTRODUCTION 


Because there are 3” — 1 possible multiple stuck faults in a logic 
network containing w distinct locations where signals may fail (each 
location may assume one of the three possible states: normal, stuck- 
at-0, or stuck-at-1), test generation and simulation procedures often 
resort to fault collapsing techniques to reduce the number of faults 
which need to be considered. To date reported results in the literature 
deal only with combinational logic. This prompted our interests in the 
research to be discussed in this paper. 

Bossen and Hong introduced a fault collapsing technique for com- 
binational logic networks called checkpoint labeling procedure.’ 
Checkpoints, as defined by them, are a number of specified points in 
the network such that any multiple fault in the network is equivalent 
to some multiple fault among these specified checkpoints.’ The check- 
points defined in Ref. 1 are a minimal set of points having the property 
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just stated. A natural question is, What are the checkpoints in a 
sequential circuit? It is well known that by cutting all the feedback 
lines, a sequential network can be mapped into a combinational 
network. A reasonable approach would be to apply Bossen and Hong’s 
labeling procedure to the resulting combinational network. The check- 
points obtained would then include all the feedback lines. In this paper, 
we show that in general all these feedback lines need not be check- 
points under a relation called ‘“delay-equivalence” to be defined later. 
We will see that the number of checkpoints can be greatly reduced for 
a highly sequential logic network if our results are utilized. This, in 
turn, greatly reduces computational complexities for multiple fault 
analysis in sequential networks. A possible application of our results 
can be found in a paper by Chang, Su, and Breuer.” 


ll, FAULT COLLAPSING IN SEQUENTIAL NETWORKS 


The checkpoints in a circuit are specially designated signal lines. 
Checkpoints are defined so that for an arbitrary stuck-at-fault a there 
exists at least one equivalent fault defined on the checkpoints. Hence, 
if there are u < w checkpoints, we need only consider 3“ — 1 multiple 
faults. Bossen and Hong have developed a labeling procedure for 
specifying the minimal set of checkpoints in a combinational circuit. 
Their results seem to be an extension of the work of Shertz and 
Poage.** 

For convenience, Bossen and Hong’s procedure is as follows: 

(t) All the primary inputs that do not fan out are checkpoints. 
(it) All the fanout branches are checkpoints. 

(tit) NOT gates are considered as lines. 

Although a sequential circuit M reduces to a combinational network 
Cw after all feedback paths have been cut, Bossen and Hong’s labeling 
procedure does not necessarily yield a minimum number of check- 
points for such circuits. 

Consider a synchronous sequential circuit M represented by Huff- 
man’s model as shown on Fig. 1, where Cy is the combinational portion 
of M, and D is the set of delay elements. We assume that faults in Cy 
are restricted to stuck-at type and faults in delay elements (D f/f’s) 
result in stuck outputs. Gates and flip-flops of M are connected by 
edges. An edge of M is either a line or a branch. Namely, it is either a 
primary input, a primary output, a fanout stem, or a fanout branch. 
Also we shall assume the following: 

Assumption 1: The output value of a gate is a function of each of its 
inputs. 

Assumption 2: The number of edges in M is finite. 

Assumption 3: There are no “inaccessible” edges in M. An edge of 
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CLOCK 


Fig. 1—Huffman’s model of a sequential circuit M. 


M is said to be inaccessible if it is not a primary input and, moreover, 
if it is not driven by any gates or memory elements of M. 

Almost all practical sequential circuits satisfy the above assump- 
tions. Figure 2 shows a fictitious integrated circuit chip. Edge 5 is 
inaccessible. 

Restricting our attention to stuck-at-faults, we define a single fault 
as exactly one edge stuck-at-1 (s-a-1), or stuck-at-0 (s-a-0), and a 
multiple fault as a collection (one or more) of single faults, each 
associated with a different edge. Also, we shall not consider intermit- 
tent faults. 

By applying Bossen and Hong’s procedure to the combinational 
network Cy, the set of checkpoints so obtained will include all feedback 
lines denoted by the vector Y. However, we shall show that this subset 
of checkpoints is not necessary under a delay-equivalence relation. 
Later we shall consider asynchronous sequential circuits. 


ill. SEQUENTIAL NETWORK WITH D FLIP-FLOPS 
Definition 1 


Two faults u and v in M are said to be delay-equivalent (d-equiva- 
lent) of order k, if M with u is equivalent to M with v after an 
application of an input sequence of length at least k. 

If the specific value of ® is not of interest to us, we shall simply say 





Fig. 2—A fictitious integrated circuit chip. 
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that u and vu are d-equivalent. To demonstrate this concept, let us refer 
to the sequential circuit shown in Fig. 3. The fault a s-a-0 is d- 
equivalent of order 1 to the multiple fault b s-a-0 and c s-a-0. 
Lemma 1: Any multiple fault in a delay flip-flop is d-equivalent of 
order 1 to a stuck input. 
Proof: Let y and Y denote, respectively, the output and input of a 
delay f/f. Then, y(q + 1) = Y(q) for all g. Thus, a stuck output is d- 
equivalent of order 1 to a stuck input. Also, a stuck input and output 
is d-equivalent of order 1 to a stuck input only, and is d-equivalent of 
order 0 to a stuck output only. 

Because of this lemma, we shall consider a delay f/f as a 1-input 
gate under the d-equivalence relation. 


Definition 2 


(i) A sequence of edges of M, denoted by [si, s2, +++, Si, +++, Sn], 
where s; ¥ s; for all 1 4 j, is said to be a forward path, if for each 
i <n either (a) s; is a fanout stem and s;+: is a fanout branch of s;, or 
(b) there exists a gate of M, say g, such that s; and s;+1 are, respectively, 
an input and the output of g. 

If s; = a and s, = £, the path [si, so, --+ , Sn] is said to be a forward 
path from « to B. If B is a primary output, the path is said to be a 
terminal forward path of a. 

(it) A sequence of edges of M, denoted by [s1, sz, «++, Si, «++, Sn], 
where s; ¥ s; for all i ¥ j, is said to be a backward path, if for each 
i <n either (a) s; is a fanout branch and its stem is si4:, or (b) there 
exists a gate of M, say h, such that s; and s;4; are, respectively, the 
output and an input of h. 

If s; = a and s, = £, the path is said to be a backward path from a 
to 8. If B is a primary input, then the sequence [s1, S2, +++ , Si, «++, Sn] 
is said to be a terminal backward path of a. 


COMBINATIONAL CIRCUIT 





EXCLUSIVE 
OR GATE 





CLOCK 


Fig. 3—A simple sequential circuit. 
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A backward path from a to B, denoted by p, is said to be a backward 
path of a, if there exists no edge I in M such that a backward path 
from a to I contains every element of p. The path p is said to terminate 
at B. 

Clearly, if 8 is a primary input of M, then a backward path from a 
to B is also a terminal backward path of a. 

The path [s1, so, --+ , Sn] is said to pass through each s;. Conversely, 
each s; is said to be contained in the path. Branch s; is also said to be 
an element of the path [si, s2, +++ , Sn]. 

As an example, consider M; on Fig. 4. The sequences [12, 18, 6, 8, 9, 
11, 20] and [12, 17, 19, 5, 8, 9, 11, 20] are forward paths of edge 12. 
However, the sequence [12, 18, 6, 8, 10, 12, 15, 16, 19, 5, 8, 9, 11, 20] is 
not a forward path of 12 because edges 12 and 8 appear twice in the 
sequence. All backward paths of edge 20 are shown in the graph of Fig. 
5. Note that this graph is a tree with edge 20 as the root, and the 
leaves are either primary inputs or fanout branches. 


Definition 3 


If every edge of M possesses at least one terminal forward path, then 
M is said to be a regular sequential circuit. Otherwise, M is said to be 
irregular. 

We will focus our attention on regular sequential circuits. 


Definition 4 


A sensitized path from an edge a = x; to an output 6 in Misa 
terminal forward path from a to £, along with constant signal values 
assigned to some of the other edges such that changing the logic value 
of x; will change the logic value of £. 





Fig. 4—Sequential circuit MM. 
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Fig. 5—Backward paths of edge 20 of M;. Underline indicates either a primary input 
or nonsingular fanout branch. Starred edges are members of K(20). 


In the circuit shown in Fig. 4, we can sensitize the edge 1 to the 
output 20 by setting edge 9 to s-a-1 and edge 13 to s-a-0. 
Lemma 2: Let g be a gate in M, a regular sequential circuit, and x, 
X2, +++ , Xn be the inputs to g. For each i, there always exists a multiple 
fault in M which will sensitize x; to an output. 
Proof: Since M is regular there exists at least one terminal forward 
path from x; to an output £. Using the concept of Boolean difference,” 
the condition for sensitizing the output of g to the input «x; is given by 
the equation 


dg 
—_—_ = @B = o,+ 9g, = 1, 
dx: 80 WV £1 = 8081 t+ 8081 
where 
80 = B(x, eee , Xi-1, 0, Xi+l, °° y Xn), 
and 
eL= B(x, cee, Xi-1, l, Xi+l, °° % 5 Xn). 
Now select any minterm of go or £0g1. If this minterm specifies x; = 
o;, where g; is either 0 or 1, then let the x; input to g be stuck at o;. Now 
assume that for the terminal forward path from x; to £, the output of 
g is an input to gate h. Repeating the process just described, h can be 
sensitized to g. This chaining process is continued until £ is reached. 
Definition 5 


Let a and £ be two edges of M. Edge a is said to dominate edge B, 
if both a s-a-0 and a s-a-1 will cause M to become independent of the 
signal on £ regardless of whether or not other edges are normal. 
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To illustrate this, let us consider Fig. 2. Edge 6 dominates edges 1 
and 3. Edge 7 dominates all other edges. However, edge 2 does not 
dominate edge 1, because when edge 3 s-a-1 and edge 4 s-a-0, edge 7 is 
a function of edge 1 independent of the state of edge 2. 

Theorem 1: Edge a dominates edge B, if and only if all terminal 
forward paths of B contain a. 

Proof: Suppose all forward paths of 8 pass through a. A stuck a will 
block any signal on 8. Thus, M becomes independent of £ if a is stuck. 
Therefore, a dominates 8. Suppose there exists one terminal forward 
path of 8 that does not contain a. Then, by Lemma 2, one can always 
find a multiple fault on M, which will sensitize the signal on £ to at 
least one of the primary outputs of M. Thus, M is dependent on 8 and 
a does not dominate £. This proves the only if part. 

Theorem 2: Dominance relation induces a partial ordering. 

Proof: Every forward path of a passes through a. Thus, a dominates 
a. Let a, dominate a2 and a, dominate a3. By Theorem 1, all forward 
paths of a3 and a2 pass through a2 and a, respectively. Thus, all 
forward paths of a3 also pass through a. This implies that a also 
dominates a3. Suppose a dominates £ and 8 dominates a. By Theorem 
1, all forward paths of 8 pass through a and all forward paths of a pass 
through £. This means that if a # £, none of a and B has a forward 
path. Therefore, a dominates 8 and 8B dominates a implies a = £. 
Namely, dominance relation is reflexive, transitive, and antisymmetric. 
We conclude that it induces a partial ordering. 

In a combinational network, a fanout branch never dominates other 
fanout branches of its stem. However, in a sequential circuit a fanout 
branch may dominate other fanout branches of its stem. Consider, for 
example, the sequential circuit M,, which is shown on Fig. 4. Branches 
9 and 10 are fanout branches of stem 8. Since every forward path of 10 
passes through 9, branch 9 dominates branch 10. 


Definition 6 


A fanout branch that dominates other fanout branches of its stem 
is said to be singular; otherwise, it is said to be nonsingular. 
Theorem 3: (i) A stem possesses at most one singular branch. 
(ti) Every singular branch dominates all other fanout branches of 
its stem. 

Proof: Let a stem which possesses singular branches be denoted by B 
and its fanout branches be denoted by ai, a2, +++, Qn, where a is 
singular. By definition, a, must dominate at least one a; (~a). If a;, 
where j ¥ 1 or i, is also singular, then a; will have at least one forward 
path that does not pass through a; because every forward path of a; 
contains the fanout stem £. Thus, a; will no longer be dominated by 
a,. This contradiction proves (1). Suppose some a; is not dominated by 
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a. Then, a, possesses at least one forward path that does not pass 
through a;. It follows that a;, for all i ¥ 1, also possesses at least one 
forward path that does not pass through ay. Part (ii) of the theorem 
follows as a result of Theorem 1. 


Definition 7 


The kernel set of edge a of M, denoted by K(a), is a set of edges of 
M such that 
(z) every backward path of a contains exactly one member of K(a), 
(it) members of K(a) are either primary inputs that do not fanout 
or nonsingular fanout branches, and 

(iit) every forward path from each member of K(a) to a does not 
contain any other nonsingular fanout branches. 

Let us consider M, again (see Fig. 4). All backward paths of edge 20 
are shown on Fig. 5. Each underlined edge is either a primary input or 
nonsingular fanout branch. Every starred edge is an element of 
the kernel set of edge 20 [i.e., K(20)]. From Fig. 5 we have K(20) = 
{1, 4, 13, 14, 15, 17, 18}. The following kernel sets can be easily verified: 
K(1) = {1}; K(12) = {7, 10}; and K(8) = {4, 14, 15, 17, 18}. 

Lemma 3: Let «a and B be any pair of edges of M. 
(t) A stuck a is d-equivalent to some multiple fault among its 
kernel set K(a). 

(it) If K(a) and K(f) are disjoint, then any multiple fault among a 
and B is d-equivalent to some multiple fault among K(a) U K(f). 
Proof: Part (i) follows jointly from the fact that any multiple faults of 
a logic gate are equivalent to multiple faults among the input lines 
only,’ and the definition of K(a) which requires that every backward 
path of a contains exactly one member of K(a). Part (iz) follows part 
(t) of the lemma. 

Lemma 4: If a does not dominate B, then either (i) there exists no 
forward path from B to a, or (ii) every forward path from B to a 
contains at least one nonsingular branch. 

Proof: If statement (z) is true, then a does not dominate £. 

Suppose there exists a forward path from £ to a, denoted by p = 
[si, «++, S,], that does not contain any nonsingular branch. Then sg; is 
either a fanout free stem or a singular branch. Let s; be singular. Then, 
si-1 1s a fanout stem. By Theorem 3, all other fanout branches of s;-1 
are dominated by s;. Therefore, all forward paths of s;_1 pass through 
s;. Thus, all forward paths of 8 also pass through a. This implies that 
a dominates 8. Suppose every s; is a stem without fanout, then p is a 
fanout free path. But by Theorem 1, this also implies that a dominate 
B. The contradiction proves the lemma. 

Theorem 4: If a does not dominate B and B does not dominate a, then 
their kernel sets are disjoint. 
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Proof: Let o « K(f). Then, o falls into exactly one of the following 
categories. 

(t) o lies in a backward path of 8 which does not contain any edge 
in common with any backward path of a. 

(it) o is contained in a forward path from a to B. 

(itt) o is contained in a backward path of B, say p, that does not 
contain a but p, and a backward path of a, say g, contains common 
branches. 

Clearly, if o belongs to category (1), o € K(a). If o belongs to category 
(it), by Lemma 4 and category (iii) of the definition of K(a), o€ K(a). 
So, consider only category (vii). Let p = [si, se, «++, Sn], and g = 
[t;, fa, --+, tm]. Category (zz) implies that there exists u and v such 
that s, = t, and s; ¥ ¢ for alli <u, 7 < v. Namely, s, (or ¢,) is a fanout 
stem. If s,-; and t,-; are nonsingular, by definition oa lies in a forward 
path from s,-; to 8; that is, o € K(a). If sy-1 is singular, by Theorem 3 
ty-1 is not. Thus, branch ¢,_; is dominated by s,-; and so is a. If s,-; is 
dominated by B then so is a. This contradicts the assumption. There- 
fore, s,-1 is not dominated by 8. By Lemma 4, there exists k = u — 1 
such that s; is a nonsingular branch. This implies that o € K(a). 
Therefore, K(8) and K(a) are disjoint. 

Theorem 5: In a regular sequential circuit, every singular branch is 
not a necessary checkpoint, but all nonsingular fanout branches are 
necessary checkpoints. 

Proof: Let B be a fanout stem with fanout branches ay, +--+ , an, where 
a, is singular. By Theorem 3, every a; ¥ ai, is dominated by a. Thus, 
a stuck a; edge is d-equivalent to a stuck B edge. As to a stuck a; edge, 
for any 1 ¥ 1, it is not d-equivalent to either a stuck B edge, or stuck 
a, edge, or stuck a; edge for 7 ¥ 1. Therefore, all nonsingular fanout 
branches of £ are necessary checkpoints. 

Theorem 6: For any irregular sequential circuit M there exists a 
regular sequential circuit M* such that M and M* are equivalent, 
and furthermore, M* preserves fault behavior of M under stuck-type 
fault assumption. 

Proof: Being irregular, M possesses two sets of edges, denoted by Q 
and {*, where each member 2 does not have any terminal forward 
paths, and each member of {* possesses at least one terminal forward 
path. It follows that no primary output of M is a function of the signals 
on any member of 22 under fault-free or any stuck-type fault conditions. 
Therefore, removing all members of Q will not alter the normal or 
abnormal functional behavior of M. The resulting circuit consists of 
only edges in {2* and is regular. 

To illustrate this theorem, consider sequential circuit M2 on Fig. 6a. 
One can easily verify that 2 = {4, 5, 8, 11, 13, 14, 16} and Q* = 
{1, 2, 3, 6, 7, 8, 9, 10, 12, 15, 17}. After removing 2 from M2 and 
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(b) 


Fig. 6—Use of Theorem 6. (a) Sequential circuit. (b) Circuit M3. 


relabeling, one obtains the sequential circuit M3, shown on Fig. 6b, 
which is actually a combinational circuit. 
Theorem 7: In a sequential circuit M, either (i) the kernel set for any 
edge in M exists, or (ii) M is irregular. 
Proof: Suppose a is an edge of M. Let a backward path of a be denoted 
by p = [si, Se, «++, S,], where s; = a. Then, p belongs to one of the 
following categories. 
(i) p is a terminal backward path; namely, s, is a primary input of 
(ti) There exists 7 <n such that s, and s,_; are fanout branches of 
t. (See Fig. 7a.) 

(iii) There exists a gate g such that a is an input of g and s, is the 

output of g. (See Fig. 7b.) 
(iv) s, is inaccessible. 

Category (iv) never occurs because of Assumption 3. 

Suppose p falls into either category (z) or (ti). Then every backward 
path of a contains at least one nonsingular fanout branch or a primary 
input. This implies that K(a) does exist if case (iii) does not apply. 

Now consider case (ii). Suppose there exists no s;, 1 =i = 7, that is, 
a fanout branch. This means that every element of p does not have a 
terminal forward path. Thus, / is irregular. 

Suppose there exists some s;, 1 = 1 <n, that is a fanout branch. If s; 
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is singular, then s;+1 is the stem of s;. Moreover, s;+1 is dominated by sj. 
If all other fanout branches of p are also singular, then by the 
transitivity of dominance, s; would be dominated by sj4:. This would 
imply that neither s; nor s;+; has a forward path. Thus, we conclude 
that some s; must be a nonsingular fanout branch and, hence, the 
theorem follows. 

From the foregoing analysis, we now derive a checkpoint labeling 
procedure for a sequential circuit M. 

Step 1. If M is irregular, convert M into M™* as in the proof of 
Theorem 6. 

Step 2. All primary inputs that do not fanout are checkpoints. 

Step 3. All nonsingular fanout branches are checkpoints. 


$4 





54 Sn 


(b) 


Fig. 7—Proof of Theorem 7. (a) Category (ii); (b) Category (vii). 
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Step 4. NOT gates are considered as lines and delay f/f’s are con- 
sidered as 1-input gates. 

Theorem 8: The above procedure yields the necessary and sufficient 
checkpoints to represent all the multiple faults in M which are 
detectable. 

Proof: Clearly, faults among the edges of {2 of Theorem 6 are unde- 
tectable and can be ignored. Let « be a multiple fault defined over a set 
of edges of M*, denoted by A = {a;}. Let B = {8;} be constructed 
from A by removing all elements of A which are dominated by some 
other elements of A. Then B is the largest subset of A such that no 
member of B is dominated by any member of B. Because any signal 
on a dominated edge will be blocked by the edges which dominate it, 
€ is equivalent to a multiple fault among B. Furthermore, since £; € B 
does not dominate £; € B for alli ~7, K(8,;) and K(B;) are disjoint. By 
Lemma 3, a multiple fault among B is equivalent to a multiple fault 
among the following collection of edges of M*:T = User K(fi). Since 
I is a subset of the collection of checkpoints W given by the above 
procedure, ¢€ is equivalent to a multiple fault among W. This proves 
the sufficiency. The necessity follows from Theorems 5 and 6. 

In comparison with the set of checkpoints obtained by applying 
Bossen and Hong’s procedure to the combinational network Cy of Fig. 
1, we see that we have removed from consideration the following edges: 
(i) all feedback lines represented by the vector Y; and (ii) all singular 
fanout branches. 

Therefore, our work greatly simplifies fault analysis of cyclic logic 
networks. 

It is important to point out that Definition 6 and Theorem 1 can be 
used to identify the singular edges. 


lV. ASYNCHRONOUS SEQUENTIAL CIRCUITS 


Having found a minimal checkpoint labeling procedure for synchro- 
nous sequential circuits with delay flip-flops as memory elements, we 
now consider asynchronous sequential circuit. The structure of an 
asynchronous sequential circuit can be represented as shown in Fig. 8. 





LUMPED 
PROPAGATION ————-—— 
DELAYS 


Fig. 8—Asynchronous sequential circuit. 
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It can be seen that the model is almost identical to the one shown in 
Fig. 1. The only difference is that the clocked delay flip-flops of Fig. 1 
are now lumped propagation delays of the feedback lines. Because of 
this, one might conjecture that the checkpoint labeling procedure is 
applicable without modification to asynchronous sequential circuits. 
This conjecture is indeed true. 

Consider Step 3 of the labeling procedure. It states that a delay flip- 
flop is considered as a 1-input gate. Under stuck-type fault assumption, 
this is equivalent to saying that a delay flip-flop is considered as an 
edge in the procedure. Therefore, we ascertain that the foregoing 
analysis carries over to an asynchronous case under the permanent 
stuck-at fault assumption. 


V. SUMMARY AND DISCUSSION 


In our analysis, we have developed a minimal checkpoint labeling 
procedure for squential circuits. This procedure is applicable to both 
synchronous sequential circuits with delay flip-flops as memory ele- 
ments and asynchronous sequential circuits. Since most of the clocked 
flip-flops in use today, such as JK f/f, latch f/f, etc., are actually made 
of asynchronous sequential circuits, the procedure is applicable to any 
synchronous sequential circuits whose memory elements are either 
delay flip-flops or the aforementioned clocked flip-flops. 


REFERENCES 


1. D. C. Bossen and S. J. Hong, “Cause-Effect Analysis for Multiple Fault Detection 
in Combination Networks,” IEEE Trans. Computers (November 1971), pp. 1252- 
7 


2. S. J. Chang, S. Y. H. Su, and M. A. Breuer, “Detection and Location of Multiple 
Stuck-Type Failures in Synchronous Sequential Circuits,” IEEE Computer Soc. 
Repository, Report No. R-72-233. 

3. D. R. Shertz, “On the Representation of Digital Faults,” Report R-418, Coordinated 
Science Laboratory, University of Illinois, Urbana, Illinois. 

4. J. F. Poage, “Derivation of Optimum Tests to Detect Faults in Combinatorial 
Circuits,” Mathematical Theory of Automata, Polytechnic Institute of Brooklyn, 
pp. 483-528, 1963. 

5. M. A. Breuer and A. D. Friedman, “Diagnosis and Reliable Design of Digital 
Systems,” Woodland Hills, California: Computer Science Press, 1976. 


FAULT-COLLAPSING ANALYSIS 2271 


CONTRIBUTORS TO THIS ISSUE 


Vaclav E. Benes, A.B., 1950, Harvard College; M.A. and Ph.D., 
1953, Princeton University; Bell Laboratories, 1953—. Mr. Benes has 
pursued mathematical research in traffic theory, stochastic processes, 
frequency modulation, combinatorics, servomechanisms, stochastic 
control, and filtering. In 1959-60 he was visiting lecturer in mathemat- 
ics at Dartmouth College. In 1971 he taught stochastic processes at 
SUNY Buffalo, and in 1971-72 he was Visiting MacKay Lecturer in 
electrical engineering at the University of California in Berkeley. He 
is the author of General Stochastic Processes in the Theory of Queues 
(Addison-Wesley, 1963), and of Mathematical Theory of Connecting 
Networks and Telephone Traffic (Academic Press, 1965). Member, 
American Mathematical Society, Association for Symbolic Logic, In- 
stitute of Mathematical Statistics, SIAM, Mathematical Association 
of America, IEEE, Phi Beta Kappa. 


Melvin A. Breuer, B.S. (Engineering), 1959, M.S., 1961, University 
of California, Los Angeles; Ph.D. (Electrical Engineering), 1965, Uni- 
versity of California, Berkeley; University of Southern California, Los 
Angeles, 1965—. At the University of Southern California, Los Angeles, 
Mr. Breuer is Professor of Electrical Engineering. His main interests 
are in the area of switching theory, computer-aided design of com- 
puters, fault diagnosis, and simulation. In addition to his research 
activities, Mr. Breuer has been at the forefront of developing courses 
on design automation and fault tolerant computing at universities and 
at a number of research institutes. He is the editor and co-author of 
Design Automation of Digital Systems: Theory and Techniques, Pren- 
tice-Hall; editor of Digital System Design Automation: Languages, 
Simulation and Data Base, Computer Science Press; co-author of 
Diagnosis and Reliable Design of Digital Systems, Computer Science 
Press; and editor-in-chief of the Journal of Design Automation and 
Fault Tolerant Computing. Member, IEEE, Sigma Xi, Tau Beta Pi, 
Eta Kappa Nu. 


Shih-Jeh Chang, Diploma E.E., 1959, Taipei Institute of Technol- 
ogy, M.S.E.E., 1966, Ph.D., 1972, University of Southern California; 
Bell Laboratories, 1972—. Early in his career at Bell Laboratories, Mr. 
Chang worked on the No. 1 Automatic Message Accounting and 
Recording Center (AMARC) in the area of system architecture, fault 
recognition and recovery. He was instrumental in the early phase 
design of 3B cache and memory management system. He also worked 


2273 


in the area of a UNIX* base real-time computer system performance 
analysis and improvement and as a UNIX system consultant. Cur- 
rently he is engaged in No. 5 ESS maintenance software design and 
development and multimodule office software development coordina- 
tion and integration. Member, IEEE, Eta Kappa Nu. 


Aland K. Chin, B.A., 1972, Brandeis University; M.S., 1975, Ph.D., 
1977, Cornell University; Senior Research Engineer, Honeywell Elec- 
tro-Optics Center, 1977-1978; Bell Laboratories, 1978—. Mr. Chin is 
involved in the design, processing, and characterization of light-emit- 
ting diodes for optical communication systems. Member, American 
Physical Society, American Association for the Advancement of Sci- 
ence, Phi Beta Kappa, New York Academy of Science, Electrochemical 
Society. 


Kai Y. Eng, B.S.E.E. (summa cum laude), 1974, Newark College of 
Engineering; M.S. (Electrical Engineering), 1976, Dr. Engr. Sc. (Elec- 
trical Engineering), 1979, Columbia University; RCA Astro-Electron- 
ics, 1974-1979; Bell Laboratories, 1979—. Mr. Eng has worked on 
various areas of microwave transmission, spacecraft antenna analysis, 
and communications satellites. He is presently a member of the Radio 
Research Laboratory, studying TV transmission through satellites. 
Member, IEEE, Sigma Xi, Tau Beta Pi, Eta Kappa Nu, Phi Eta 
Sigma. 


James L. Flanagan, B.S. (Electrical Engineering) 1948, Mississippi 
State University; M.S., 1950, Sc.D., 1955, Massachusetts Institute of 
Technology. Bell Laboratories 1957—. Mr. Flanagan is Head, Acous- 
tics Research Department. He has project responsibilities for digital 
voice encoding, speech recognition and synthesis, electroacoustic sys- 
tems and transducers. Fellow, IEEE, Acoustical Society of America. 
Member, National Academy of Engineering. 


Ralph E. Frazee, Jr., (Electronic Engineering); Western Electric, 
1969—. At the Western Electric Engineering Research Center, 
Princeton, N.J., Mr. Frazee has been primarily engaged in research on 
manufacturing process control and inspection systems. His current 
interests include measurement, characterization, and control of optical 
fiber manufacturing processes. 


* Registered trademark of Bell Laboratories. 


2274 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


Allen Gersho, B.S.(E.E.), 1960, Massachusetts Institute of Tech- 
nology; M.S., 1961, and Ph.D., 1963, Cornell University; Bell Labora- 
tories, 1963-1980; University of California, Santa Barbara, 1980—. At 
Bell Laboratories, Mr. Gersho was a member of the technical staff at 
the Mathematics and Statistics Research Center where he was engaged 
in basic and applied research in digital communications, data trans- 
mission, quantization, and signal processing. At the University of 
California, he holds a professorship in Electrical and Computer Engi- 
neering. He has served as Editor of the IEEE Communications Mag- 
azine and is currently an Associate Editor of Communication Theory, 
IEEE Transactions on Communications. Member, Board of Governors 
of the IEEE Communications Society. 


Ytzhak Levendel, B.S.E.E., 1971, Technion-Israel; M.S.C.S., 1974, 
The Weitzman Institute of Science; Ph.D., 1976, University of South- 
ern California; Bell Laboratories 1976—. Mr. Levendel has done re- 
search in fault diagnosis and is currently involved in the development 
of a logic and test design aid system. Member, IEEE, Eta Kappa Nu. 


Tong L. Lim, A.B., B.E., 1973, Dartmouth College; M.S.E.E., 1974, 
Ph.D. (E.E.), 1976, University of California, Berkeley; University of 
Pennsylvania, 1976-1978; Bell Laboratories, 1978—. Mr. Lim has 
worked on problems related to radiophysics and radar and sonar signal 
processing. His current interests are in data transmission and graphics 
communication. Member, IEEE, Sigma Xi; Associate Editor, IEEE 
Communications Magazine. 


Subhash Mahajan, B.S. (Physics, Chemistry, and Mathematics), 
1959, Panjab University, B.E. (Metallurgy), 1961, Indian Institute of 
Science, Bangalore, and Ph.D. (Materials Science), 1965, University of 
California, Berkeley; University of Denver, 1965-1968; United King- 
dom Atomic Energy Authority Research Fellow, Atomic Energy Re- 
search Establishment, Harwell, England, 1968-1971; Bell Laboratories, 
1971—. Mr. Mahajan has done research in the nucleation and growth 
of deformation twins in metallic crystals, mechanisms of phase trans- 
formations and plastic deformation, and in the role of deformation 
twins in the nucleation of fraction. The central theme of his research 
has been to establish the correlations between structure and properties 
in materials. Most recently he has been involved in understanding the 
interrelationships between substructure and growth parameters in ITI- 
V compound semiconductors. He is presently a group supervisor in the 
Materials Research Laboratory. Member, AIME, Electrochemical So- 
ciety, Sigma Xi, New York Academy of Sciences. 


CONTRIBUTORS TO THIS ISSUE 2275 


David Malah, B.S. (E.E.), 1964, M.S. (E.E.), 1967, Technion-Israel 
Institute of Technology, Ph.D. (E.E.), 1971, University of Minnesota; 
University of New Brunswick, Canada N.B., 1970-72; Technion-Israel 
Institute of Technology, 1972—; sabbatical leave, Bell Laboratories, 
1979-81. At the University of Minnesota, Mr. Malah was Assistant 
Professor in the Electrical Engineering Department. At Technion- 
Israel, he is Associate Professor of Electrical Engineering. During 
1975-1979, Mr. Malah was in charge of the Signal Processing Labora- 
tory at Technion-Israel and was involved in research on speech and 
image communications, and real-time digital hardware implementa- 
tions. Additionally, he has done research on discrete time systems, 
digital signal processing techniques, speech communications, and adap- 
tive filtering. While on leave he is conducting research on narrow-band 
communications and digital signal processing techniques in the Acous- 
tics Research Department at Bell Laboratories. 


Premachandran R. Menon, B.S. (Electrical Engineering), 1954, 
Banaras Hindu University; Ph.D. (Electrical Engineering), 1962, Uni- 
versity of Washington; Bell Laboratories, 1963—. Mr. Menon has done 
research in switching theory and fault diagnosis and is currently 
involved in the development of a logic simulation system. Member, 
IEEE. 


Markus S. Mueller, E.E. Diploma, 1970, Ph.D. (Electronic Engi- 
neering), 1976, Swiss Federal Institute of Technology, Zurich, Switz- 
erland; Swiss Federal Institute of Technology, Zurich, 1971-1976; 
GenRad, Zurich, 1976-1978; Bell Laboratories, 1978-1981. Mr. Mueller 
was teaching and research assistant at the Swiss Federal Institute of 
Technology, where he worked in various fields, including filter theory, 
data transmission, and adaptive signal processing. He was a product 
specialist with GenRad for computer controlled automatic test sys- 
tems. At Bell Laboratories, he was a member of the Data Theory 
Group in the Data Systems and Technology Department and his 
interest was in data communication, digital signal processing, and 
adaptive systems. 


Jack Salz, B.S.E.E., 1955, M.S.E., 1956, and Ph.D., 1961, University 
of Florida; Bell Laboratories, 1961—. Mr. Salz first worked on remote 
line concentrators for the electronic switching system. Since 1968, he 
has supervised a group engaged in theoretical studies in data commu- 
nications, and he is currently a member of the Communications 
Methods Research Department. During the academic year 1967-68, 
he was on leave as Professor of Electrical Engineering at the University 


2276 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


of Florida. In Spring, 1981, he was a visiting lecturer at Stanford 
University. Member, Sigma Xi. 


Dhiraj K. Sharma, B. Tech. (Electrical Engineering), 1971, LI.T. 
Kanpur, India; M.S. and Ph.D. (Electrical Engineering), 1972 and 1975, 
California Institute of Technology; Bell Laboratories 1975—. Mr. 
Sharma has worked on efficient encoding of video signals and identi- 
fication of structural parameters of tall buildings. His current interests 
are in screen-based human interfaces, communication and synchroni- 
zation in distributed systems, and programming languages. Member, 
ACM, Sigma Xi. 


David H. Smithgall, B.S. (Electrical Engineering), 1967, M.S., 
1968, Ph.D., 1970, Cornell University; Western Electric, 1970—. At the 
Western Electric Engineering Research Center, Princeton, N.J., Mr. 
Smithgall has been engaged primarily in research in process charac- 
terization and control. Since 1973 he has been involved in lightguide 
fiber measurement and characterization of the fiber and preform 
fabrication processes. Member, IEEE. 


Raymond Steele, B.S., 1959, Ph.D. (Electrical Engineering) 1975, 
Durham University, Durham, England; E. K. Cole, Ltd., 1959-1961; 
Cossor Radar, Electronics, Ltd., 1961-1962; The Marconi Company, 
1962-1965; Royal Naval College, 1965-1968; Loughborough University 
of Technology, 1968-1979; Bell Laboratories, 1979—. At E. K. Cole, 
Ltd., Cossor Radar, Electronics, Ltd., and The Marconi Company, all 
located in Essex, England, Mr. Steele was engaged in research and 
development. As a member of the Lecturing Staff at the Royal Naval 
College in London he lectured on telecommunications for the NATO 
and external London University degree courses. At the Loughborough 
University of Technology in Loughborough, England, he directed a 
research group in digital encoding of speech and television signals, in 
addition to serving as Senior Lecturer. Before joining Bell Laboratories 
on a full time basis, Mr. Steele served as a part-time consultant to the 
Laboratories’ Acoustics Research Department. Presently, he is a mem- 
ber of the Communications Methods Research Department. He is the 
author of Delta Modulation Systems published in 1975. 


Henryk Temkin, M.S. 1971, Yeshiva University, Ph.D. (Physics), 
1975, Stevens Institute of Technology; Cornell University, 1975-1977; 
Bell Laboratories, 1977—; Mr. Temkin is working on developing light- 
emitting diodes for optical communications. Member, American Phys- 
ical Society, Electrochemical Society. 


CONTRIBUTORS TO THIS ISSUE 2277 


Diane Vitello, A.A. (Mathematics) Brookdale Community College; 
Bell Laboratories, 1962—. Ms. Vitello is with the Communications 
Methods Research Department. She has written computer programs 
for radio research projects in antenna design, satellite tracking, and 
fiber optic transmission. She has also written several user-oriented 
programs to aid in the design and analysis of terrestrial and satellite 
digital radio systems. Ms. Vitello is currently continuing undergraduate 
work in Mathematics at Monmouth College. 


On-Ching Yue, B.E.E., 1968, Cooper Union; M.S.E.E., 1971, Roch- 
ester Institute of Technology; Ph.D. (Information Sciences), 1977, 
University of California at San Diego; General Dynamics/Electronics 
Division, 1968-1977; Bell Laboratories, 1977—. Mr. Yue has worked in 
the areas of underwater acoustics and microwave imaging. Since 
joining Bell Laboratories, he has been a member of the Radio Research 
Laboratory, studying the effect of interference on digital communica- 
tions systems, including intersymbol, adjacent satellite, and multiuser 
interferences. Member, Tau Beta Pi, Eta Kappa Nu, Phi Kappa Phi; 
Senior Member, IEEE. 


2278 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


PAPERS BY BELL LABORATORIES AUTHORS 


COMPUTING /MATHEMATICS 


On Ternary Self-Dual Codes of Length 24. J. Leon, V. Pless, and N. J. A. Sloane, 
IEEE Trans Information Theory, [T-27 (March 1981), pp 176-80. 


ENGINEERING 


Broadband Astigmatic Compensation. T.S. Chu, IEEE/AP-S Int Symp Digest 
(June 15, 1981), pp 131-4. 

Digital Satellite Communications-Problems and Possibilities. V. I. Johannes, 
Conf Proc, Commun Techniques Seminar (March 24, 1981), pp 6-1-6.4. 

Plots and Tests for Goodness of Fit With Randomly Censored Data. V.N. Nair, 
Biometrika, 68, No. 1 (April 1981), pp 99-103. 

Process Simulation in Two Dimensions. B. R. Penumalli, 1981 IEEE Int Solid- 
State Circuits Conf Digest of Technical Papers, XXIV (February 1981), pp 212-3. 


Per-Channel, Memory Oriented, Transmultiplexer With Logarithmic Process- 
ing—Architecture and Simulation. C.F. Kurth, K. J. Bures, and P. R. Gagnon, Int 
Commun Conf, 1981, Conf Record, J, No. 7.3 (June 15, 1981), pp 7.3.1-5. 
Per-Channel, Memory Oriented, Transmultiplexer With Logarithmic Process- 
ing—Lab Model Implementation and Testing. C.F. Kurth, P. R. Gagnon, and K. 
J. Bures, Int Commun Conf, 1981 Conf Record, J, No. 7.4 (June 15, 1981), pp 7.4.1-4. 
‘Ultraviolet Radiation Curable Conformal Coatings for Printed Circuit Board 
Assemblies. G. B. Fefferman and T. S. Hsu, Proc Printed Circuit World Convention 
II, 7 (June 1981), pp 303-15. 


PHYSICAL SCIENCES 


Boundary-Layer Model of Field Effects in a Bistable Liquid-Crystal Geometry. 
J. Cheng, R. N. Thurston, and D. W. Berreman, J Appl Phys, 52 (April 1981), pp 2756- 
65. 

Electronic Charge Density of V3Si. L. F. Mattheiss and D. R. Hamann, Solid State 
Commun, 38, No. 8 (May 1981), pp 689-94. 

Electronic Stacking-Fault States in Silicon. L. F. Mattheiss and J. R. Patel, Phys 
Rev, 23, No. 10 (May 15, 1981), pp 5384-96. 

Equilibrium and Stability of Liquid Crystal Configurations in an Electric Field. 
R. N. Thurston and D. W. Berreman, J Appl Phys, 52 (January 1981), pp 508-9. 
Erythrocyte Protoporphyrin/Heme Ratio by Hematofluorometry. A. A. La- 
mola, J. Hisinger, and W. E. Blumberg, Clinical Chem, 26 (1980), p 677. 

Generation of High-Brightness Coherent Radiation in the Vacuum Ultraviolet 
by Four-Wave Parametric Oscillation in Mercury Vapor. J. Bokor, R. R. Free- 
man, R. L. Panick, and J. C. White, Opt Lett, 6, No. 4 (April 1981), pp 182-4. 
Interplay of Structure and Magnetic Properties in MnSi: A Concentrated Spin 
Glass. TT. M. Hayes, J. W. Allen, J. B. Joyce, and J. J. Hauser, Phys Rev, 23, No. 9 
(May 1, 1981), pp 4691-7. 

Light Scattering by Photoexcited Two-Dimensional Electron Plasma in GaAs- 
(A1Ga) as Heterostructures. A. Pinczuk, J. Shah, A. C. Gossard, and W. Wiegmann, 
Phys Rev Lett, 46, No. 20 (May 18, 1981), pp 1341-4. 


Liquid Crystals and Geodesics. R. N. Thurston and F. J. Almgren, J De Phys 
(Paris), 42 (March 1981), pp 413-7. 

Orientation and Implantation Effects on Stacking Faults During Silicon Buried 
Layer P Processing. M. Robinson, G. A. Rozgonyi, T. E. Seidel, and M. H. Read, J 
Electrochem Soc, 128 (April 1981), pp 926-9. 

The Propagation of Disclinations in Bistable Switching. J. Cheng and R. N. 
Thurston, J Appl Phys, 52 (April 1981), pp 2766-75. 


2279 


Sensitivity Analysis of Oscillating Reactions. 1. The Period of the Oregonator. 
D. Edelson and V. M. Thomas, J Phys Chem, 85 (May 1981), pp 1555-8. 


Stability of Nematic Liquid Crystal Configurations. R.N. Thurston, J De Phys 
(Paris), 42 (March 1981), pp 419-25. 


Unit Cell of the y-Phase of Poly(Vinylidene Fluoride). A. J. Lovinger, Macro- 
molecules, 14, No. 2 (March 1981), pp 322-5. 


Unit Sphere Description of Liquid Crystal Configurations. KR. N. Thurston, J 
Appl Phys, 52 (April 1981), pp 3040-52. 


2280 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1981 


CONTENTS, DECEMBER 1981 


Source Coding for Multiple Descriptions. II: A Binary Source 
H. S. Witsenhausen and A. D. Wyner 


Microprocessor Firmware Update Inventory Model 
S. M. Brecher 


On the Performance of Phase-Shift-Keying Systems 
V. K. Prabhu and J. Salz 


On the Rapid Initial Convergence of Least-Squares Equalizer Adjust- 
ment Algorithms 
M. S. Mueller 


Criteria for the Response of Nonlinear Systems to be L-Asymptotically 
Periodic 
I. W. Sandberg 


Study of a Time-Compression Technique for TV Transmission Using 
a Chirp Filter and Envelope Detection 
K. Y. Eng and B. G. Haskell 


Conformal Mapping and Complex Coordinates in Cassegrainian and 
Gregorian Reflector Antennas , 
C. Dragone 


2281 








THE BELL SYSTEM TECHNICAL JOURNAL is abstracted or indexed by Abstract 
Journal in Earthquake Engineering, Applied Mechanics Review, Applied Science & 
Technology Index, Chemical Abstracts, Computer Abstracts, Current Contents/ 
Engineering, Technology & Applied Sciences, Current Index to Statistics, Current 
Papers in Electrical & Electronic Engineering, Current Papers on Computers & 
Control, Electronics & Communications Abstracts Journal, The Engineering Index, 
International Aerospace Abstracts, Journal of Current Laser Abstracts, Language 
and Language Behavior Abstracts, Mathematical Reviews, Science Abstracts (Series 
A, Physics Abstracts; Series B, Electrical and Electronic Abstracts: and Series C, 
Computer & Contro! Abstracts), Science Citation Index, Sociological Abstracts, 
Social Welfare, Social Planning and Social Development, and Solid State Abstracts 
Journal. Reproductions of the Journal by years are available in microform from 
University Microfilms, 300 N. Zeeb Road, Ann Arbor, Michigan 48106. 


